You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This makefile works everywhere and "dep-test" will run and do an audio to text...
Text to Text might also be useful as another "extension".
Just raising to see if there is support for integration or not.
The text was updated successfully, but these errors were encountered:
gedw99
changed the title
speak to text , and Text to Text for language trans
speak to text , and Text to Text for language trans "extensions"
Oct 29, 2024
Hello! This is a interesting idea, I can't speak to whether adding extensions would be feasible, or if that is something Sean would like to add. However, I do have reservations to adding speech recognition. I use and rely on stt and have first hand experience in how inaccurate the transcribing can be. I don't believe that it would be accurate enough to be useful, in my experience the transcriptions often require heavy editing to match what was spoken.
With that said, if it was going to be implemented, I think adding some warnings about accuracy would be a good idea, and that the text may be inaccurate or misleading. This is important because if a user is exclusively relying on the text to understand what is going on they would have no way to verify the accuracy.
I am optimistic about the technology, it has gotten significantly better recently. I would be very interesting to hear other people's opinions about it.
whisper can be wrapped with golang easily and then the system can do speak to text.
working demo here:
https://github.com/gedw99/galene-stt that is NOT integrated with broadcast-box yet.
This makefile works everywhere and "dep-test" will run and do an audio to text...
Text to Text might also be useful as another "extension".
Just raising to see if there is support for integration or not.
The text was updated successfully, but these errors were encountered: