-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
迷你项目:python cli(command line interface) and api(application programming interface) #2069
Comments
提一个:wenet.cpp, paraformer.cpp, ... etc (int4量化+多平台支持) |
现在paraformer 可以导出jit, 之后会支持onnx 其次 whisper 也可以考虑拿过来, espnet已经有reproduce whisper的pr了 |
目前在完善deepspeed的易用性,下一步就是reproduce whisper了 |
cli
cli-paraformer 推理基本功能完成
NOTE: 前两,ref:https://modelscope.cn/models/damo/speech_timestamp_prediction-v1-16k-offline/summary8 |
新需求: 端到端输出产生字幕 |
新需求:希望支持流式。目前cli 下的只支持音频文件,把音频文件按流式丢进去一塌糊涂 |
目的
其背景是现有的基于 c++ 的 python binding 扩展、开发、维护成本都比较高,大家普遍反馈环境问题也比较多。
功能
pip install wenet[onnx]
@xingchensong #2089欢迎大家提需求。
The text was updated successfully, but these errors were encountered: