-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[examples] better results on wenetspeech using revised transcripts #2371
Conversation
update whisper result:
As you can see, the overall results are already very close to those of Paraformer. I believe that by adding more open-source data, Whisper can definitely achieve better performance than Paraformer. |
Any plan to release this trained model? |
目前效果还没全面超越paraformer,没有release的意义,另外听闻阿里有意发布更好的多语种模型,后续在那个更强的base模型上去调可能会更好 |
@Ryuk17 另外,现在在wenet训练200M级别的模型已经很高效了,8卡3090,使用deepspeed 3~4天就能跑出来上面的u2pp_conformer结果 |
了解,谢谢 |
传了一个whipser的 链接:https://pan.baidu.com/s/1R1mJyqc_MwKZy6Vn32Z3zA?pwd=171n |
这个微调后的模型能重新转回到openai style的ckpt吗 |
|
see wenet-e2e/WenetSpeech#54 for more infos