🐛 [Bug] Error when compiling `aten` model with intermediate `int64` tensors #1864

gs-olive · 2023-04-27T20:09:56Z

Bug Description

When compiling the T5-Base Model model via the aten path, the following error is encountered:

  File "~/TensorRT/py/torch_tensorrt/fx/fx2trt.py", line 303, in placeholder
    name=target, shape=tuple(shape), dtype=torch_dtype_to_trt(dtype)
  File "~/TensorRT/py/torch_tensorrt/fx/utils.py", line 45, in torch_dtype_to_trt
    raise TypeError("%s is not supported by tensorrt" % dtype)
TypeError: torch.int64 is not supported by tensorrt

Since none of the input tensors have type int64, it is presumed that some intermediate tensor encountered during partitioning takes an int64 input, which are generally associated with indices in Torch.

To Reproduce

Steps to reproduce the behavior:

Initialize model: T5Model.from_pretrained("t5-base").eval().cuda()
Initialize three input tensors, for example: torch.randint(0, 1, (1, 14), dtype=torch.int32).to("cuda") ("input_ids", "attention_mask", "decoder_input_ids")
(Optional) Use the transformers tools to trace the model via: transformers.utils.fx.symbolic_trace(model, input_names=["input_ids", "attention_mask", "decoder_input_ids"])
Compile the model using FX

Expected behavior

Model should compile via the aten path

Environment

Transformers: 4.26.1
Torch-TensorRT Version (e.g. 1.0.0): b3f433a
PyTorch Version (e.g. 1.0): 2.1.0.dev20230419+cu117
CPU Architecture: Intel Xeon CPU
OS: Ubuntu 20.04
How you installed PyTorch: pip
Build command you used: python setup.py develop
Are you using local sources or building from archives: local
Python version: 3.8.13
CUDA version: 11.7

The text was updated successfully, but these errors were encountered:

github-actions · 2023-07-27T00:02:13Z

This issue has not seen activity for 90 days, Remove stale label or comment or this will be closed in 10 days

gs-olive · 2023-08-01T17:46:30Z

Addressed in Dynamo path via #1983

gs-olive added the bug Something isn't working label Apr 27, 2023

gs-olive self-assigned this Apr 27, 2023

gs-olive mentioned this issue Apr 27, 2023

fix: Add support for truncate_long_and_double in FX #1865

Closed

gs-olive mentioned this issue May 31, 2023

✨[Feature] Add support for truncate_long_and_double in Dynamo compile #1964

Closed

github-actions bot added the No Activity label Jul 27, 2023

gs-olive closed this as completed Aug 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🐛 [Bug] Error when compiling `aten` model with intermediate `int64` tensors #1864

🐛 [Bug] Error when compiling `aten` model with intermediate `int64` tensors #1864

gs-olive commented Apr 27, 2023

github-actions bot commented Jul 27, 2023

gs-olive commented Aug 1, 2023

🐛 [Bug] Error when compiling aten model with intermediate int64 tensors #1864

🐛 [Bug] Error when compiling aten model with intermediate int64 tensors #1864

Comments

gs-olive commented Apr 27, 2023

Bug Description

To Reproduce

Expected behavior

Environment

github-actions bot commented Jul 27, 2023

gs-olive commented Aug 1, 2023

🐛 [Bug] Error when compiling `aten` model with intermediate `int64` tensors #1864

🐛 [Bug] Error when compiling `aten` model with intermediate `int64` tensors #1864