-
Notifications
You must be signed in to change notification settings - Fork 606
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Refine Whisper and WeNet Contents Extractor #32
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Are there any more pip dependencies?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We need to discuss for whisper extractor. Line154 is modified by me, which is different from the original whisper code. The purpose is to avoid from downloading whisper's pretrained model automatically from the OpenAI's server, since some running servers of users (like me) are out of network.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If you put your checkpoints in the right place, i.e. model path specified in your config, I don't think it should be a problem though.
No. Since |
Modified the download_root of whisper model. If the given path is a file, its dir will be used as the download root. On the other hand, it will be used directly if it is a folder. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Use black to format this file.
modules/whisper_extractor
. Change to OpenAI's Whisper API for content features' extraction.modules/wenet_extractor
.