You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Running inference of transformers4rec model on triton fails with RuntimeError: PyTorch execute failure: Expected Tensor but got GenericList.
Similar issues in the past were solved by updating the forward function (see here, for example - triton-inference-server/server#3348). However, this solution is not straightforward for transformers4rec.
Steps/Code to reproduce bug
Model ensemble is generated using a traced model and merlin.systems.dag.Ensemble:
The model ensemble contains a nvtabular transformation workflow and t4rec pytorch prediction workflow, with the following structure: (list of categorical and continuous features is substantially larger than is listed here)
Bug description
Running inference of transformers4rec model on triton fails with
RuntimeError: PyTorch execute failure: Expected Tensor but got GenericList
.Similar issues in the past were solved by updating the forward function (see here, for example - triton-inference-server/server#3348). However, this solution is not straightforward for transformers4rec.
Steps/Code to reproduce bug
merlin.systems.dag.Ensemble
:The model ensemble contains a nvtabular transformation workflow and t4rec pytorch prediction workflow, with the following structure: (list of categorical and continuous features is substantially larger than is listed here)
instances.json example file:
1_predictpytorchtriton
.Expected behavior
Triton would return prediction output.
Environment details
Additional context
@niraj06 @evagian
The text was updated successfully, but these errors were encountered: