[Wav2Vec2] Improve SpecAugment function by converting numpy based fun… #10494

punitvara · 2021-03-03T05:44:59Z

…ction to pytorch based function

Implements #10459

What does this PR do?

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors which may be interested in your PR.

patrickvonplaten · 2021-03-03T06:16:25Z

src/transformers/models/wav2vec2/modeling_wav2vec2.py

-        mask_idcs.append(np.unique(mask_idc[mask_idc < sz]))
+        mask_idc = torch.randperm(sz - min_len)[:num_mask]
+        mask_idc = torch.from_numpy(
+            np.asarray([mask_idc[j] + offset for j in range(len(mask_idc)) for offset in range(lengths[j])])


Let's try to not use np here. If we use torch.from_numpy(...) - it's not GPU-friendly

mask_idc = torch.tensor([mask_idc[j] + offset for j in range(len(mask_idc)) for offset in range(lengths[j])])

I just did this. Is this right ?

patrickvonplaten · 2021-03-03T06:16:46Z

src/transformers/models/wav2vec2/modeling_wav2vec2.py


-    min_len = min([len(m) for m in mask_idcs])
+    min_len = torch.min(mask_idcs)
    for i, mask_idc in enumerate(mask_idcs):


Let's try to get rid of the for-loop and do tensor operations only

I am not sure how to do this. Something like following but not getting how should I put it ? Can you help here.

mask[i, mask_idc] = [True, torch.randperm(mask_idc)[:min_len] if torch.tensor(mask_idcs).size() > min_len]

patrickvonplaten · 2021-03-03T06:17:20Z

We need to run benchmark tests to see by how much the speed improved both on CPU and GPU

…ction to pytorch based function Implements huggingface#10459 fixes some style changes

punitvara · 2021-03-08T07:08:30Z

@patrickvonplaten Can you please help with above comments ?

patrickvonplaten · 2021-03-09T21:00:11Z

Hey @punitvara,

At the moment, I sadly don't have the time to handle the big chunk of the PR. It would be great if you could try to:

Find a way to benchmark your new function on GPU and show that it yields a speed-up in the forward pass compared to the old function
Try out some advanced PyTorch indexing to replace the for loops.

Taking a look at those PRs should help you: #9600, #9453, #6064

patrickvonplaten · 2021-04-14T12:04:01Z

Closing due to inactivity. Sorry @punitvara, I saw a lot of interest from other people to open a PR and this one seems to have stalled. Feel free to re-open it and give it a second shot if you want :-)

punitvara · 2021-04-14T12:07:16Z

I got busy into some other work. I will try to work on different issue. If you get any PR, feel free to merge it

punitvara force-pushed the feature branch 2 times, most recently from 53a39ae to 0af1ec9 Compare March 3, 2021 06:07

patrickvonplaten reviewed Mar 3, 2021

View reviewed changes

[Wav2Vec2] Improve SpecAugment function by converting numpy based fun…

4f4c0be

…ction to pytorch based function Implements huggingface#10459 fixes some style changes

punitvara force-pushed the feature branch from 0af1ec9 to 4f4c0be Compare March 4, 2021 15:00

patrickvonplaten closed this Apr 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Wav2Vec2] Improve SpecAugment function by converting numpy based fun… #10494

[Wav2Vec2] Improve SpecAugment function by converting numpy based fun… #10494

punitvara commented Mar 3, 2021 •

edited

Loading

patrickvonplaten Mar 3, 2021

punitvara Mar 4, 2021

patrickvonplaten Mar 3, 2021

punitvara Mar 4, 2021

patrickvonplaten commented Mar 3, 2021

punitvara commented Mar 8, 2021

patrickvonplaten commented Mar 9, 2021

patrickvonplaten commented Apr 14, 2021

punitvara commented Apr 14, 2021

[Wav2Vec2] Improve SpecAugment function by converting numpy based fun… #10494

[Wav2Vec2] Improve SpecAugment function by converting numpy based fun… #10494

Conversation

punitvara commented Mar 3, 2021 • edited Loading

What does this PR do?

Before submitting

Who can review?

patrickvonplaten Mar 3, 2021

Choose a reason for hiding this comment

punitvara Mar 4, 2021

Choose a reason for hiding this comment

patrickvonplaten Mar 3, 2021

Choose a reason for hiding this comment

punitvara Mar 4, 2021

Choose a reason for hiding this comment

patrickvonplaten commented Mar 3, 2021

punitvara commented Mar 8, 2021

patrickvonplaten commented Mar 9, 2021

patrickvonplaten commented Apr 14, 2021

punitvara commented Apr 14, 2021

punitvara commented Mar 3, 2021 •

edited

Loading