fix(avatar): preserve audio wrappers across avatar hot-swaps by longcw · Pull Request #5863 · livekit/agents

longcw · 2026-05-27T07:27:15Z

Summary

Avatar plugins set session.output.audio = DataStreamAudioOutput(...) on every start. On the first start this works because AgentSession.start() wraps the sink with the TranscriptSynchronizer / RecorderAudioOutput chain afterwards; on a mid-session rebind (avatar switch) the raw assignment blows the chain away, silently breaking transcription sync and recording.

Fix it by auto-inserting an _AudioSinkProxy at the bottom of any wrapper chain. Wrappers cache the proxy, the proxy holds the swappable leaf — so hot-swaps preserve the wrappers above. New AgentOutput.swap_audio_endpoint(sink) walks the chain to the proxy and swaps its downstream in place, leaving the wrappers attached; full replacement stays as output.audio = sink.

Plugin migration

All 13 avatar plugins migrated to swap_audio_endpoint(...): anam, avatario, avatartalk, bey, bithuman, did, keyframe, lemonslice, liveavatar, runway, simli, tavus, trugen.

Example

examples/avatar_agents/audio_wave now demonstrates hot-swapping through a swap_avatar RPC method: it tears down the current avatar (removing it from the room) and launches a fresh one under the same identity, while the audio wrappers and the listeners attached to session.output.audio survive the swap.

AvatarSession.start() rebinds session.output.audio to a fresh DataStreamAudioOutput. On the first call the wrapper chain (Recorder, TranscriptSynchronizer) wraps it correctly, but a re-bind during a mid-session avatar switch overwrites the synchronizer-wrapped output with a raw sink, breaking audio/transcription sync and recording. Introduce _AudioSinkProxy, a transparent proxy auto-inserted at the bottom of any wrapper chain. Wrappers cache the proxy (not the leaf), so the leaf can be hot-swapped via the proxy without invalidating upstream references. When the proxy has no inner sink, flush() synthesizes a playback_finished so upstream wrappers don't hang. Add AgentOutput.set_audio_sink(sink, *, preserve_wrappers=False). With preserve_wrappers=True, walks the chain to find the proxy and swaps its downstream; otherwise behaves as the existing audio setter. Avatar plugins migrate to this API; AvatarSession.aclose() detaches the sink so the chain stays intact across aclose -> restart. Drops the "may be replaced by the avatar" warning in AvatarSession.start since the proxy makes mid-session rebinding correct by construction.

…ers=True) Route every avatar plugin's audio sink binding through the new AgentOutput.set_audio_sink API so mid-session hot-swaps (e.g. avatar switches) preserve the TranscriptSynchronizer / RecorderAudioOutput wrapper chain. Plugins migrated: anam, avatario, avatartalk, bey, bithuman, did, keyframe, liveavatar, runway, simli, tavus, trugen.

Covers: - auto-wrap inserts the proxy between a wrapper and a bare leaf - auto-wrap skipped when the downstream is already a proxy or a non-leaf - set_audio_sink default replaces the chain - set_audio_sink with preserve_wrappers swaps the proxy's inner in place - preserve_wrappers fallback when no proxy exists in the chain - proxy rejects a wrapper chain as inner (set_next_in_chain assert) - detached proxy synthesizes playback_finished on flush - swap routes new-leaf playback events to upstream listeners - swap disconnects the old leaf from the chain - on_attached/on_detached propagate to current inner and across swaps

Drop the leaf-only assertion in _AudioSinkProxy.set_next_in_chain — the base AudioOutput machinery cascades capture/flush and bubbles playback events through any chain, so the proxy can hold either a leaf or a wrapper chain without breaking the contract upstream.

The base class doesn't track which sink the avatar set, so nulling session.output.audio unconditionally could clobber a sink owned by someone else. The wrapper chain stays intact across hot-swaps anyway because the proxy preserves the wrappers regardless of what's in its downstream slot, so leaving the sink in place until it's replaced or the session tears down is fine.

Drop the preserve_wrappers flag: the wrapper-preserving leaf swap is now its own method, and full replacement stays as output.audio = sink.

…wrappers-on-avatar-swap

The detached no-op mode of _AudioSinkProxy synthesized a synchronous playback_finished during flush(), which re-entered _SyncedAudioOutput.on_playback_finished and caused a double rotate_segment while skipping end_audio_input. Nothing passes None in practice, so require a real sink instead of fixing the re-entrancy.

devin-ai-integration

Devin Review found 2 new potential issues.

devin-ai-integration · 2026-06-11T09:35:24Z

+    @property
+    def sample_rate(self) -> int | None:
+        if self._sample_rate is not None:
+            return self._sample_rate
+        return self.next_in_chain.sample_rate if self.next_in_chain else None


🚩 sample_rate is now dynamic instead of fixed at construction time

Both _SyncedAudioOutput (synchronizer.py:553-557) and RecorderAudioOutput (recorder_io.py:366-370) changed from setting sample_rate at construction time (passing it to super().__init__) to computing it dynamically via a property that delegates to self.next_in_chain.sample_rate. This is intentional for the hot-swap use case — after swapping the leaf sink, the sample rate should reflect the new sink's requirements. However, in generation.py:418-428, the resampler is created lazily on the first frame and never recreated. If the sample rate changes after the first frame (e.g., from a hot-swap), the resampler won't be updated. This is a pre-existing limitation, not introduced by this PR, but it becomes more relevant now that hot-swapping is supported.

Was this helpful? React with 👍 or 👎 to provide feedback.

devin-ai-integration

Devin Review found 1 new potential issue.

A swap while a flushed segment was still playing out removed the old sink's listeners before its playback_finished arrived, leaving the playback accounting unbalanced and wait_for_playout() hanging. Now the proxy clears the old sink's buffer and reports the orphaned segment as interrupted; a segment still being captured continues on the new sink, which reports it on its own.

devin-ai-integration

Devin Review found 1 new potential issue.

theomonnom · 2026-06-12T03:25:26Z

            else:
                self._audio_sink.on_detached()

+    def swap_audio_endpoint(self, sink: AudioOutput) -> None:


Suggested change

def swap_audio_endpoint(self, sink: AudioOutput) -> None:

def replace_audio_leadl(self, sink: AudioOutput) -> None:

not a fan of the name

should it be something like replace_audio_sink?

"lead" reads as the head of the chain, but the method replaces the tail, how about

replace_audio_endpoint

replace_audio_destination

redirect_audio

replace_audio_tail

longcw added 6 commits May 27, 2026 14:56

clean

4d101d5

chenghao-mou requested a review from a team May 27, 2026 07:27

This comment was marked as resolved.

Sign in to view

fix synchronizer

3e21868

This comment was marked as resolved.

Sign in to view

fix RecorderAudioOutput sample rate

b3c3dd3

theomonnom reviewed May 27, 2026

View reviewed changes

Comment thread livekit-agents/livekit/agents/voice/io.py Outdated

longcw requested a review from theomonnom June 3, 2026 00:54

longcw added 3 commits June 3, 2026 21:36

rename set_audio_sink to swap_audio_endpoint

5296871

Drop the preserve_wrappers flag: the wrapper-preserving leaf swap is now its own method, and full replacement stays as output.audio = sink.

example(avatar): hot-swap the avatar via an RPC method

edf8965

Merge remote-tracking branch 'origin/main' into longc/preserve-audio-…

f676d3e

…wrappers-on-avatar-swap

This comment was marked as resolved.

Sign in to view

longcw mentioned this pull request Jun 9, 2026

VideoAvatar handling does not allow per-agent session customization #4198

Open

devin-ai-integration Bot reviewed Jun 11, 2026

View reviewed changes

longcw force-pushed the longc/preserve-audio-wrappers-on-avatar-swap branch from 2900ae1 to b62cd5f Compare June 11, 2026 12:12

devin-ai-integration Bot reviewed Jun 11, 2026

View reviewed changes

Comment thread livekit-agents/livekit/agents/voice/io.py

theomonnom reviewed Jun 12, 2026

View reviewed changes

theomonnom approved these changes Jun 12, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(avatar): preserve audio wrappers across avatar hot-swaps#5863

fix(avatar): preserve audio wrappers across avatar hot-swaps#5863
longcw wants to merge 13 commits into
mainfrom
longc/preserve-audio-wrappers-on-avatar-swap

longcw commented May 27, 2026 •

edited

Loading

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

Uh oh!

This comment was marked as resolved.

Uh oh!

devin-ai-integration Bot left a comment

Uh oh!

Uh oh!

devin-ai-integration Bot Jun 11, 2026

Uh oh!

devin-ai-integration Bot left a comment

Uh oh!

devin-ai-integration Bot left a comment

Uh oh!

Uh oh!

theomonnom Jun 12, 2026

Uh oh!

davidzhao Jun 12, 2026

Uh oh!

longcw Jun 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	def swap_audio_endpoint(self, sink: AudioOutput) -> None:
	def replace_audio_leadl(self, sink: AudioOutput) -> None:

Conversation

longcw commented May 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Plugin migration

Example

Related

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

Uh oh!

This comment was marked as resolved.

Uh oh!

devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

devin-ai-integration Bot Jun 11, 2026

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

theomonnom Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

davidzhao Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

longcw Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

longcw commented May 27, 2026 •

edited

Loading