ensure playback_segments_count is consistent in the audio output chain #4211

longcw · 2025-12-09T10:28:54Z

when next_in_chain is set in AudioOutput, await self.next_in_chain.capture_frame(frame) should be called before await super().capture_frame(frame) (in case the capture_frame of next_in_chain is waiting for some conditions) to make sure the audio outputs in the chain have the same playback_segments_count.

this fixes the issue when the agent has a greeting message using session.say or session.generate_reply in Agent.on_enter (or just after session started), it will stuck when closing if no participant joined the room.

chenghao-mou

LGTM!

theomonnom

lgtm! So our pattern breaks if one of the nodes in the output chain decides not to forward audio to the next node (for whatever reason)
I thought this pattern was supposed to work — ideally, we should make it work.

* main: (267 commits) AGT-2328: negative threshold in silero (livekit#4228) disable interruptions for agent greeting (livekit#4223) feature: GPT-5.2 support (livekit#4235) turn-detector: remove english model from readme (livekit#4233) add keep alive task for liveavatar plugin (livekit#4231) feat(warm-transfer): add sip_number parameter for outbound caller ID (livekit#4216) fix blocked send task in liveavatar plugin (livekit#4214) clear _q_updated right after await to avoid race conditions (livekit#4209) ensure playback_segments_count is consistent in the audio output chain (livekit#4211) fix inworld punctuation handling (livekit#4215) Inference: Rename fallback model name param (livekit#4202) fix race condition when stop background audio play handle (livekit#4197) fix watchfiles prevent agent prcoess exit on sigterm (livekit#4194) feat(google): add streaming support for Gemini TTS models (livekit#4189) Add LiveAvatar Stop Session API Call + README Fix (livekit#4195) Fallback API for Inference (livekit#4099) feat(rime): expand update_options to accept all TTS parameters (livekit#4095) mistralai models update (livekit#4156) fix record.exc_info is not pickable when using LogQueueHandler (livekit#4185) Restore otel chat message (livekit#4118) ...

ensure playback_segments_count is consistent in the audio output chain

7874996

longcw requested a review from a team December 9, 2025 10:29

chenghao-mou approved these changes Dec 9, 2025

View reviewed changes

theomonnom approved these changes Dec 9, 2025

View reviewed changes

longcw merged commit d933132 into main Dec 10, 2025
17 of 18 checks passed

longcw deleted the longc/capture-frame-deadlock branch December 10, 2025 00:51

longcw mentioned this pull request Dec 10, 2025

ctx.add_shutdown_callback not invoked when SIP call is unanswered/cut (v1.3.5 regression) #4152

Closed

This was referenced Dec 20, 2025

ctx.add_shutdown_callback still not invoked on unanswered/cut SIP calls in v1.3.9 (Regression from #4152) #4345

Closed

Shutdown handler not invoked when call is not received or is disconnected #4392

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ensure playback_segments_count is consistent in the audio output chain #4211

ensure playback_segments_count is consistent in the audio output chain #4211

Uh oh!

longcw commented Dec 9, 2025 •

edited

Loading

Uh oh!

chenghao-mou left a comment

Uh oh!

theomonnom left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ensure playback_segments_count is consistent in the audio output chain #4211

ensure playback_segments_count is consistent in the audio output chain #4211

Uh oh!

Conversation

longcw commented Dec 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chenghao-mou left a comment

Choose a reason for hiding this comment

Uh oh!

theomonnom left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

longcw commented Dec 9, 2025 •

edited

Loading