-
Notifications
You must be signed in to change notification settings - Fork 110
[AI-271] Elevenhour labs Scribe2 #170
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Caution Review failedThe pull request is closed. WalkthroughThis pull request introduces ElevenLabs Scribe v2 real-time STT (Speech-to-Text) integration for Vision Agents. It adds a new STT plugin implementation with WebSocket-based audio streaming, a comprehensive test suite, example code demonstrating integration with GetStream and Gemini LLM, updated project metadata, and a participant fixture for testing. Changes
Sequence DiagramsequenceDiagram
actor User
participant Vision_Agent
participant STT as ElevenLabs STT
participant WebSocket as ElevenLabs WebSocket
participant AudioQueue as Audio Queue
participant Transcript_Emitter as Event Emitter
User->>Vision_Agent: send PCM audio
Vision_Agent->>STT: process_audio(pcm_data, participant)
STT->>STT: resample to 16kHz mono if needed
STT->>AudioQueue: enqueue audio
par Continuous Processing
STT->>STT: _send_audio_loop()
STT->>AudioQueue: dequeue batch
STT->>STT: base64 encode
STT->>WebSocket: send audio bytes
and WebSocket Listening
WebSocket-->>STT: on_partial_transcript event
STT->>Transcript_Emitter: emit partial transcript
Transcript_Emitter-->>Vision_Agent: transcript event
WebSocket-->>STT: on_committed_transcript event
STT->>Transcript_Emitter: emit final transcript
Transcript_Emitter-->>Vision_Agent: transcript event
end
alt Connection Error
WebSocket-->>STT: on_error event
STT->>STT: _attempt_reconnect (exponential backoff)
STT->>WebSocket: re-establish connection
end
STT-->>Vision_Agent: transcripts + metadata (confidence, language)
Estimated code review effort🎯 4 (Complex) | ⏱️ ~45 minutes Key areas requiring careful attention:
Possibly related PRs
Suggested labels
Suggested reviewers
Poem
✨ Finishing touches
🧪 Generate unit tests (beta)
📜 Recent review detailsConfiguration used: Path: .coderabbit.yaml Review profile: CHILL Plan: Pro Disabled knowledge base sources:
⛔ Files ignored due to path filters (1)
📒 Files selected for processing (12)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
Summary by CodeRabbit
Release Notes
New Features
Documentation
Other