-
Notifications
You must be signed in to change notification settings - Fork 221
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
espnet example? #171
Comments
Yes, this is very cool. |
Yeah, I have a Kaldi NSC recipe here: https://github.com/pzelasko/kaldi/tree/feature/nsc-recipe/egs/nsc/s5 However, it is not well-tuned - I simply replicated the LibriSpeech recipe as it has a similar amount of data, and got about 31% WER on a held-out, unofficial test set (I don't think there is an official one). |
I just made an NSC recipe by using the lhotse data preparation in espnet espnet/espnet#2846 |
Cool! Going forward, would you be interested to explore using Lhotse's Python API? You can see an example in snowfall's librispeech recipe here and then K2 training uses a dedicated Dataset class here (I could help with writing the Dataset class, I'm actually trying to figure out what is the right way to make a generic template or a set of utilities for that to support many tasks) As Dan mentioned that'd definitely help us add and tune relevant features like data augmentation, etc. |
Resolved by #384 (there is a colab notebook link with an example of Lhotse + ESPnet integration) |
I have been following espnet https://github.com/espnet/espnet and they are open to new recipes, they have merged a couple in the last week or so. I think lhotse+espnet would be a good combination because Shinji's real interest is, I think, in end-to-end architectures and abstracting away the data-prep may appeal to him.
Would also be a good opportunity to tune things like data augmentation recipes.
The text was updated successfully, but these errors were encountered: