Transcription streaming along with Audio parts responses #41

mandroidV2 · 2024-12-26T18:34:30Z

Description of the feature request:

HI
is it possible to get the transcriptions stream along with returned audio parts? I tried to set audio and text in response modalities but it didn't work.
Thanks!

What problem are you trying to solve with this feature?

I would want to show a transcript in the real time for the audio output

Any other information you'd like to share?

No response

hapticdata · 2025-01-02T16:28:25Z

Hi, it is currently not possible for the model to return both text and audio tokens. If you need this functionality immediately your best option is to receive text tokens and stream those into another speech-to-text system such as google/speech-to-text

hapticdata added enhancement New feature or request model enhancement and removed enhancement New feature or request labels Jan 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Transcription streaming along with Audio parts responses #41

Transcription streaming along with Audio parts responses #41

mandroidV2 commented Dec 26, 2024

hapticdata commented Jan 2, 2025

Transcription streaming along with Audio parts responses #41

Transcription streaming along with Audio parts responses #41

Comments

mandroidV2 commented Dec 26, 2024

Description of the feature request:

What problem are you trying to solve with this feature?

Any other information you'd like to share?

hapticdata commented Jan 2, 2025