🐛 [Bug] Error when compiling aten
model with intermediate int64
tensors
#1864
Labels
aten
model with intermediate int64
tensors
#1864
Bug Description
When compiling the T5-Base Model model via the
aten
path, the following error is encountered:Since none of the input tensors have type
int64
, it is presumed that some intermediate tensor encountered during partitioning takes anint64
input, which are generally associated with indices in Torch.To Reproduce
Steps to reproduce the behavior:
T5Model.from_pretrained("t5-base").eval().cuda()
torch.randint(0, 1, (1, 14), dtype=torch.int32).to("cuda")
("input_ids", "attention_mask", "decoder_input_ids")transformers
tools to trace the model via:transformers.utils.fx.symbolic_trace(model, input_names=["input_ids", "attention_mask", "decoder_input_ids"])
Expected behavior
Model should compile via the
aten
pathEnvironment
python setup.py develop
The text was updated successfully, but these errors were encountered: