Script MultiheadAttention #1524

cndn · 2019-12-19T00:37:40Z

Summary:
Make fairseq MultiheadAttention scriptable. Looking for feedbacks.

Add types
Move incremental state management logic from util functions to initializers. TorchScript in general doesn't support global dict. As a result modules with multihead attention in it would assign itself fairseq_instance_id in the initializer.
There might be opportunities to make assertions and annotations cleaner.

Differential Revision: D18772594

facebook-github-bot · 2019-12-19T00:38:08Z

This pull request was exported from Phabricator. Differential Revision: D18772594

Summary: Pull Request resolved: facebookresearch/fairseq#1524 Make fairseq MultiheadAttention scriptable. Looking for feedbacks. 1. Add types 2. Move incremental state management logic from util functions to initializers. TorchScript in general doesn't support global dict. As a result modules with multihead attention in it would assign itself fairseq_instance_id in the initializer. 3. There might be opportunities to make assertions and annotations cleaner. Differential Revision: D18772594 fbshipit-source-id: 81b830b16fbaa9c6fc34dee0672054f146060ea4

facebook-github-bot · 2020-01-10T01:54:00Z

This pull request was exported from Phabricator. Differential Revision: D18772594

facebook-github-bot · 2020-01-12T00:37:21Z

This pull request was exported from Phabricator. Differential Revision: D18772594

Summary: Pull Request resolved: pytorch#681 Pull Request resolved: facebookresearch/fairseq#1524 Make fairseq MultiheadAttention scriptable. Looking for feedbacks. 1. Add types 2. Move incremental state management logic from util functions to initializers. TorchScript in general doesn't support global dict. As a result modules with multihead attention in it would assign itself fairseq_instance_id in the initializer. 3. There might be opportunities to make assertions and annotations cleaner. Differential Revision: D18772594 fbshipit-source-id: 4353d522d244b1508190d33ca5be6f2299e8442c

facebook-github-bot · 2020-01-14T23:41:05Z

This pull request was exported from Phabricator. Differential Revision: D18772594

Differential Revision: D18799003 fbshipit-source-id: a7e088997e9d246f1216b1ce0e4deff43354c9a7

Summary: Pull Request resolved: pytorch/translate#681 Pull Request resolved: facebookresearch#1524 Make fairseq MultiheadAttention scriptable. Looking for feedbacks. 1. Add types 2. Move incremental state management logic from util functions to initializers. TorchScript in general doesn't support global dict. As a result modules with multihead attention in it would assign itself fairseq_instance_id in the initializer. 3. There might be opportunities to make assertions and annotations cleaner. Differential Revision: D18772594 fbshipit-source-id: 8b8b87f0e74f4afb863b15fc4172482b640f6197

facebook-github-bot · 2020-01-16T18:10:00Z

This pull request was exported from Phabricator. Differential Revision: D18772594

Summary: Pull Request resolved: pytorch#681 Pull Request resolved: facebookresearch/fairseq#1524 Make fairseq MultiheadAttention scriptable. Looking for feedbacks. 1. Add types 2. Move incremental state management logic from util functions to initializers. TorchScript in general doesn't support global dict. As a result modules with multihead attention in it would assign itself fairseq_instance_id in the initializer. 3. There might be opportunities to make assertions and annotations cleaner. Differential Revision: D18772594 fbshipit-source-id: 5c21d7d84db1320201f486015bb91469006ffd95

Summary: Pull Request resolved: fairinternal/fairseq-py#1002 Pull Request resolved: pytorch/translate#681 Pull Request resolved: #1524 Make fairseq MultiheadAttention scriptable. Looking for feedbacks. 1. Add types 2. Move incremental state management logic from util functions to initializers. TorchScript in general doesn't support global dict. As a result modules with multihead attention in it would assign itself fairseq_instance_id in the initializer. 3. There might be opportunities to make assertions and annotations cleaner. Reviewed By: myleott Differential Revision: D18772594 fbshipit-source-id: 377aef4bbb7ef51da5b6bac9a87a6f7b03b16fe1

Summary: Pull Request resolved: fairinternal/fairseq-py#1002 Pull Request resolved: pytorch/translate#681 Pull Request resolved: facebookresearch#1524 Make fairseq MultiheadAttention scriptable. Looking for feedbacks. 1. Add types 2. Move incremental state management logic from util functions to initializers. TorchScript in general doesn't support global dict. As a result modules with multihead attention in it would assign itself fairseq_instance_id in the initializer. 3. There might be opportunities to make assertions and annotations cleaner. Reviewed By: myleott Differential Revision: D18772594 fbshipit-source-id: 377aef4bbb7ef51da5b6bac9a87a6f7b03b16fe1

Summary: Pull Request resolved: fairinternal/fairseq-py#1002 Pull Request resolved: pytorch/translate#681 Pull Request resolved: facebookresearch/fairseq#1524 Make fairseq MultiheadAttention scriptable. Looking for feedbacks. 1. Add types 2. Move incremental state management logic from util functions to initializers. TorchScript in general doesn't support global dict. As a result modules with multihead attention in it would assign itself fairseq_instance_id in the initializer. 3. There might be opportunities to make assertions and annotations cleaner. Reviewed By: myleott Differential Revision: D18772594 fbshipit-source-id: 377aef4bbb7ef51da5b6bac9a87a6f7b03b16fe1

facebook-github-bot added the CLA Signed label Dec 19, 2019

cndn force-pushed the export-D18772594 branch from dd49d1c to 8517451 Compare January 10, 2020 01:53

cndn mentioned this pull request Jan 10, 2020

Script MultiheadAttention (#1524) pytorch/translate#681

Closed

cndn force-pushed the export-D18772594 branch from 8517451 to 39bb962 Compare January 12, 2020 00:37

cndn force-pushed the export-D18772594 branch from 39bb962 to f9153b3 Compare January 14, 2020 23:41

cndn added 2 commits January 16, 2020 10:09

formatting multihead_attention.py

8cd7c23

Differential Revision: D18799003 fbshipit-source-id: a7e088997e9d246f1216b1ce0e4deff43354c9a7

cndn force-pushed the export-D18772594 branch from f9153b3 to 83bb2d2 Compare January 16, 2020 18:09

facebook-github-bot closed this in pytorch/translate@23cedf7 Jan 22, 2020

facebook-github-bot added the Merged label Jan 22, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Script MultiheadAttention #1524

Script MultiheadAttention #1524

cndn commented Dec 19, 2019

facebook-github-bot commented Dec 19, 2019

facebook-github-bot commented Jan 10, 2020

facebook-github-bot commented Jan 12, 2020

facebook-github-bot commented Jan 14, 2020

facebook-github-bot commented Jan 16, 2020

Script MultiheadAttention #1524

Script MultiheadAttention #1524

Conversation

cndn commented Dec 19, 2019

facebook-github-bot commented Dec 19, 2019

facebook-github-bot commented Jan 10, 2020

facebook-github-bot commented Jan 12, 2020

facebook-github-bot commented Jan 14, 2020

facebook-github-bot commented Jan 16, 2020