Temporarily remove the attention model and fix pytorch_struct model. #558

xuzhao9 · 2021-11-11T16:15:07Z

torchtext removes the legacy dataset utilities(pytorch/text#1437), therefore we need to migrate to the new dataset API or keep the old API but copy the related code here. This PR we still use the old API because it seems non-trivial to migrate to the new API.

I will re-add the attention model in a follow-up PR (and do the quality analysis there).

Also, pytorch_struct model runs an unsupervised learning task, therefore, it does not support eval test.

batch size analysis

Batch Size	GPU Time	CPU Dispatch Time	Walltime	GPU Delta
16	53.06	52.969	53.072	-
32	85.123	75.824	85.126	0.6042781757
64	157.218	121.155	157.23	0.8469508828
128	315.678	242.213	315.681	1.007899859
256	568.102	428.017	568.098	0.7996249343

Non-idleness analysis (train, bs=128)

GPU is mostly idle when bs=32, so I am testing with bs=128 instead.
Data is already prefetched to the device.

xuzhao9 · 2021-11-12T17:11:07Z

Disable JIT because of the following exception:

torch.jit.frontend.UnsupportedNodeError: function definitions aren't supported:
  File "/home/circleci/project/torchbenchmark/models/pytorch_struct/networks/NeuralCFG.py", line 46
        T, NT = self.T, self.NT
    
        def terms(words):
        ~~~ <--- HERE
            b, n = input.shape[:2]
            term_prob = (

eircfb

Got to make it work.

pytorch_struct utilization isn't great, all those tiny dispatches - might be a good place to try cudagraphs.

facebook-github-bot · 2021-11-12T17:23:37Z

@xuzhao9 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

aaronenyeshi

Thanks for fixing this

facebook-github-bot · 2021-11-12T20:38:03Z

@xuzhao9 merged this pull request in e71b8d0.

Remove the attention model and fix pytorch_struct model.

418b311

facebook-github-bot added the cla signed label Nov 11, 2021

xuzhao9 changed the title ~~Remove the attention model and fix pytorch_struct model.~~ [WIP] Remove the attention model and fix pytorch_struct model. Nov 11, 2021

xuzhao9 added 3 commits November 11, 2021 11:31

Remove genbmm and try again.

fd4ccfb

Cleanup the code.

973265f

Submit another fix.

61dba83

xuzhao9 changed the title ~~[WIP] Remove the attention model and fix pytorch_struct model.~~ [WIP] Temporarily remove the attention model and fix pytorch_struct model. Nov 11, 2021

xuzhao9 added 9 commits November 11, 2021 19:51

Submit torchtext legacy data code.

db9e613

Another fix.

2db8fea

Another fix.

a922e1a

Set train bs.

8cc9fb6

Set device

1d5ff51

Use the best bs for training.

468a4bd

Prefetch the data

cfa24b2

Changed batch size.

6caeb52

Add limit to data size.

59f5f11

xuzhao9 changed the title ~~[WIP] Temporarily remove the attention model and fix pytorch_struct model.~~ Temporarily remove the attention model and fix pytorch_struct model. Nov 12, 2021

xuzhao9 requested a review from eircfb November 12, 2021 03:08

xuzhao9 added 2 commits November 12, 2021 09:35

Fix metadata.

f5e39a5

Disable jit on this model.

d53b4c0

eircfb approved these changes Nov 12, 2021

View reviewed changes

aaronenyeshi approved these changes Nov 12, 2021

View reviewed changes

facebook-github-bot closed this in e71b8d0 Nov 12, 2021

facebook-github-bot added the Merged label Nov 12, 2021

xuzhao9 deleted the xz9/fix-broken-main branch November 12, 2021 23:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Temporarily remove the attention model and fix pytorch_struct model. #558

Temporarily remove the attention model and fix pytorch_struct model. #558

xuzhao9 commented Nov 11, 2021 •

edited

Loading

xuzhao9 commented Nov 12, 2021

eircfb left a comment

facebook-github-bot commented Nov 12, 2021

aaronenyeshi left a comment

facebook-github-bot commented Nov 12, 2021

Temporarily remove the attention model and fix pytorch_struct model. #558

Temporarily remove the attention model and fix pytorch_struct model. #558

Conversation

xuzhao9 commented Nov 11, 2021 • edited Loading

batch size analysis

Non-idleness analysis (train, bs=128)

xuzhao9 commented Nov 12, 2021

eircfb left a comment

Choose a reason for hiding this comment

facebook-github-bot commented Nov 12, 2021

aaronenyeshi left a comment

Choose a reason for hiding this comment

facebook-github-bot commented Nov 12, 2021

xuzhao9 commented Nov 11, 2021 •

edited

Loading