Closed
Description
Bug Description
The PR #3167 is supporting weight-stripped engines, which works for PythonTorchTensorRTModule
but not for TorchTensorRTModule
.
I observed the issue in the test:
TensorRT/tests/py/dynamo/models/test_weight_stripped_engine.py
Lines 487 to 523 in 76bdf5e
The CI test reports the error:
FAILED models/test_weight_stripped_engine.py::TestWeightStrippedEngine::test_two_TRTRuntime_in_refitting - AssertionError: False is not true : TorchTensorRTModule outputs don't match with the original model. Cosine sim score: 0.0 Threshold: 0.99
I output refitted_output
while using TorchTensorRTModule
, which is all zeros, so it seems like the refitting was not successful.