-
Notifications
You must be signed in to change notification settings - Fork 662
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add EMFORMER_RNNT_BASE_MUSTC bundle #2222
Conversation
test/torchaudio_unittest/example/emformer_rnnt/test_mustc_lightning.py
Outdated
Show resolved
Hide resolved
@@ -1,38 +1,51 @@ | |||
#!/usr/bin/env python3 | |||
"""Train the SentencePiece model by using the transcripts of MuST-C release v2.0 training set. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This description does not seem to be correct.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This PR does a lot of things. Can you split it into smaller ones?
I see things not related to the bundle, which can be done independently.
- Change the argument from
--model_type
to--model-type
(and similar tocheckpoint_path
) - Add shebang lines to existing scripts
- Update to
eaval.py
Then there are smaller changes that are required by MUSTC model addition. - Refactor of
pipeline_demo.py
(the part without MUSTC model) - Addition of tedlium3 model test (and refactor of MockSentencePiece)
Finally, add MUST-C model. - Add MUSTC bundle.
@@ -0,0 +1,32 @@ | |||
class MockSentencePieceProcessor: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The file name for utilities cannot include test
prefix / suffix. IIRC, it causes some issue with pytest.
torchaudio.prototype.pipelines