Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Transcription streaming along with Audio parts responses #41

Open
mandroidV2 opened this issue Dec 26, 2024 · 1 comment
Open

Transcription streaming along with Audio parts responses #41

mandroidV2 opened this issue Dec 26, 2024 · 1 comment

Comments

@mandroidV2
Copy link

Description of the feature request:

HI
is it possible to get the transcriptions stream along with returned audio parts? I tried to set audio and text in response modalities but it didn't work.
Thanks!

What problem are you trying to solve with this feature?

I would want to show a transcript in the real time for the audio output

Any other information you'd like to share?

No response

@hapticdata hapticdata added enhancement New feature or request model enhancement and removed enhancement New feature or request labels Jan 2, 2025
@hapticdata
Copy link
Collaborator

Hi, it is currently not possible for the model to return both text and audio tokens. If you need this functionality immediately your best option is to receive text tokens and stream those into another speech-to-text system such as google/speech-to-text

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants