MPT: Change order of operands to enable PT2 compile for inference #559

tdoublep · 2023-08-25T17:54:48Z

This is a trivial change, which doesn't actually change any logic, but it very helpful to enable PyTorch 2 compile for inference using mpt-based models.

During compilation, evaluating attention_mask[:, 0].sum() != attention_mask.shape[0] seems to cause a bunch of problems because it compares the shape of the tensor to its actual contents, and leads to graph breaks. By changing the ordering of the operands in this if statement, the problem is resolved since self.training gets evaluated first and fails, thus preventing the rest from being evaluated.

dakinggg

LGTM, thanks!

tdoublep · 2023-08-25T21:08:47Z

Anything I can do regarding the failing checks? Not 100% sure but doesn't seem to be related to the proposed code change.

dakinggg · 2023-08-25T21:14:14Z

Its just the autoformatting, if you run pre-commit run --all-files and commit, it will fix it.

tdoublep · 2023-08-28T07:50:14Z

@dakinggg I ran the code formatting, but looks like an approval is needed for the checks to re-run.

Small change to enable PT2 compile for inference

b64bd89

dakinggg approved these changes Aug 25, 2023

View reviewed changes

code formatting

e9bceed

dakinggg merged commit a8c7dc4 into mosaicml:main Aug 28, 2023
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MPT: Change order of operands to enable PT2 compile for inference #559

MPT: Change order of operands to enable PT2 compile for inference #559

tdoublep commented Aug 25, 2023

dakinggg left a comment

tdoublep commented Aug 25, 2023

dakinggg commented Aug 25, 2023

tdoublep commented Aug 28, 2023

MPT: Change order of operands to enable PT2 compile for inference #559

MPT: Change order of operands to enable PT2 compile for inference #559

Conversation

tdoublep commented Aug 25, 2023

dakinggg left a comment

Choose a reason for hiding this comment

tdoublep commented Aug 25, 2023

dakinggg commented Aug 25, 2023

tdoublep commented Aug 28, 2023