You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This is exactly what I need, I want to add live captioning to VRChat.
I spent all day getting vosk to work but results are poor, Whisper looks promising.
I can't wait for this 😄
EDIT: Does this fork/pull request currently work? it gives me various actions like breathing, but no words.
The whisper.cpp repo has a real time audio input example. I think we should have a version of that in the rust code here.
I'm on my second iteration of it, following similar techniques to those used in the the cpp example code.
How do you feel about this being a part of the whisper-rs repo?
The other packages that I use are:
cpal: microphone recording
webrtc-vad: voice recognition detection
ringbuf: a ring buffer
The text was updated successfully, but these errors were encountered: