Support streaming tool output and deduplication by timvisher-dd · Pull Request #7 · timvisher-dd/agent-shell-plus

timvisher-dd · 2026-03-15T20:53:37Z

Checklist

I agree to communicate (PR description and comments) with the author myself (not AI-generated).
I've reviewed all code in PR myself and will vouch for its quality.
I've read and followed the Contributing guidelines.
I've filed a feature request/discussion for a new feature.
I've added tests where applicable.
I've run M-x checkdoc and M-x byte-compile-file.

Problem

The two most popular agent ACPs, codex-acp and claude-agent-acp, perform very poorly in agent-shell when tool executions emit a lot of text:

codex-acp: O(n²) rendering and massive data transfer. A 35k-line bash command takes ~60s and transfers ~890 MB of JSON — a 3,000× amplification of ~280 KB actual output. Each tool_call_update carries the full accumulated output; agent-shell replaces the entire fragment body and reruns markdown-overlays-put on every update.
claude-agent-acp: output is silently lost. The same command truncates to 241 of 35,001 lines. The user sees raw <persisted-output> XML tags rendered verbatim in the shell buffer.

Cause

agent-shell does not advertise _meta.terminal_output in clientCapabilities during the ACP initialize handshake. Without this capability:

codex-acp falls back to sending the full accumulated output in every tool_call_update (O(n²) content growth).
claude-agent-acp sends a single truncated result at completion instead of streaming the full output.

Fix

Advertise _meta.terminal_output during initialize and handle the resulting streaming behavior:

Extend acp.el to accept :terminal-capability and :meta-capabilities on acp-make-initialize-request.
Pass those capabilities from agent-shell.el during initialize.
Handle incremental _meta.terminal_output.data chunks (codex-acp) and batch _meta.terminal_output results (claude-agent-acp) in a new streaming handler with deduplication.
Strip <persisted-output> tags and render previews cleanly.

Implementation

New files

agent-shell-meta.el — extractors for ACP _meta payloads:

agent-shell--meta-lookup — key lookup handling both symbol and string keys in alists.
agent-shell--meta-find-tool-response — walks any _meta namespace to find a toolResponse value.
agent-shell--tool-call-meta-response-text — extracts stdout text from _meta.*.toolResponse in its various shapes (string, alist with stdout key, vector of content blocks).
agent-shell--tool-call-terminal-output-data — extracts _meta.terminal_output.data.

agent-shell-streaming.el — streaming tool call update handler:

agent-shell--tool-call-normalize-output — strips markdown fences, strips <persisted-output> XML tags (rendering the preview with font-lock-comment-face), and ensures trailing newlines.
agent-shell--append-tool-call-output — accumulates streamed output in the state's :tool-calls hash under an :accumulated key per tool call ID.
agent-shell--handle-tool-call-update-streaming — the main handler, replacing the inline tool_call_update block in agent-shell.el. Three branches:
1. Terminal data (_meta.terminal_output.data): normalize the chunk, accumulate it, and immediately append it to the fragment body for live streaming.
2. Meta response (_meta.*.toolResponse): normalize and accumulate silently (rendered only on final update to avoid duplication).
3. Final update (status is "completed" or "failed"): render accumulated output (or fall back to content text), log to transcript, clean up permission dialogs, and apply title/label updates.
agent-shell--mark-tool-calls-cancelled — marks all in-progress tool calls as cancelled (called from agent-shell-interrupt).

Changes to `agent-shell.el`

(require 'agent-shell-streaming) added.
The ~50-line inline tool_call_update rendering block is replaced by a single call to agent-shell--handle-tool-call-update-streaming. The metadata save (title/description/command/raw-input/diff) remains inline before the handler call.
The initialize request now passes :terminal-capability t and :meta-capabilities '((terminal_output . t)) to acp-make-initialize-request.
agent-shell-interrupt calls agent-shell--mark-tool-calls-cancelled after sending the cancel notification.
shell-maker-define-major-mode call passes 'agent-shell-mode-map (quoted symbol) instead of the bare variable.

Tests

7 new tests in tests/agent-shell-streaming-tests.el:

agent-shell--tool-call-meta-response-text-test — extracts text from _meta.claudeCode.toolResponse.stdout.
agent-shell--tool-call-normalize-output-test — strips fences and ensures trailing newline.
agent-shell--tool-call-normalize-output-persisted-output-test — strips <persisted-output> tags.
agent-shell--tool-call-update-writes-output-test — verifies accumulated output is written to the fragment body.
agent-shell--tool-call-meta-response-no-duplication-test — meta response text is rendered once, not duplicated with content.
agent-shell-initialize-request-meta-capabilities-test — the initialize request includes _meta.terminal_output.
agent-shell--tool-call-terminal-output-data-streaming-test — codex-style _meta.terminal_output.data chunks are accumulated and rendered incrementally.

Perf measurements

Test: for x in {0..35000}; do printf 'line %d\n' "$x"; done (35,001 lines)

codex-acp

	measure_ms (avg)	content_bytes (avg)	terminal_bytes (avg)
Without terminal caps	~60,000	~900,000,000	0
With terminal caps	~7,500	~3,000	~240,000

~8× faster. Content drops from ~900 MB to ~3 KB.

claude-agent-acp

	measure_ms (avg)	content_bytes	terminal_bytes
Without terminal caps	~22,000	2,321 (truncated to 241 lines)	0
With terminal caps	~23,000	0	2,270

No timing improvement (execution is server-side), but <persisted-output> tags are handled cleanly.

Prerequisite: acp.el changes

acp.el needs to accept :terminal-capability and :meta-capabilities keyword arguments on acp-make-initialize-request. See xenodium/acp.el#15.

Point .claude, .codex, .gemini directories to .agents and their respective markdown files to AGENTS.md so all IDE agents share a single source of truth. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

CI runs four jobs: byte-compilation + ERT tests, agent config symlink verification, dependency DAG cycle detection, and README update check. bin/test drives all checks locally by parsing ci.yml with yq so the two stay in sync automatically. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…ning - Quote agent-shell-mode-map symbol in shell-maker-define-major-mode (macros expect the symbol name, not the variable value) - Fix decorator tests: use setq-local inside let instead of let* shadowing the buffer-local, so buffer-local-value finds the binding - Suppress message output in copy-session-id test - Add forward declaration for agent-shell-text-file-capabilities in devcontainer module to silence byte-compiler Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

New module for checking buffer-level invariants during agent-shell operation: process-mark ordering, fragment ordering per namespace, ui-state property contiguity, and content-store consistency. Includes a per-buffer event ring for tracing, rate-limited violation reporting with debug bundle capture, and comprehensive ERT tests. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

New utility module for walking nested _meta namespaces in ACP tool call updates. Handles multiple agent response shapes (stdout string, content string, vector of text items) and provides clean accessors for toolResponse text and streaming terminal output data. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

New module for incremental tool call output rendering. Accumulates chunks from _meta.*.toolResponse and _meta.terminal_output, strips backtick fences and <persisted-output> tags, and appends deltas in-place to avoid O(n²) full-block rebuilds during streaming. Includes per-tool-call output markers, UI state caching, and comprehensive tests covering codex-acp terminal output, claude-agent batch results, and error handling. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Wire the new streaming, meta, and invariants libraries into the core agent-shell and UI layers: - Advertise _meta.terminal_output capability in session/new - Extract tool-call update handler into agent-shell-streaming - Deduplicate thought chunks via accumulated-delta tracking - Add insert-cursor (marker with insertion-type t) so fragments appear in creation order above the prompt - Preserve process-mark across fragment updates so context insertion and prompt position remain stable - Debounce markdown overlay application during streaming appends to avoid O(n²) re-parsing - In-place body append in agent-shell-ui to avoid delete-and-reinsert that displaces point - Fix context insertion: goto-char insert-start so point lands at the prompt after inserting context - Add context insertion regression tests Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

- Add live-validate command documentation for verifying rendering pipeline changes with a live batch session - Update AGENTS.md development workflow with live-validate step - Update README.org features list: expand CI sub-features, add streaming dedup, DWIM context insertion, and runtime invariants Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…notification

…ADME

Signed-off-by: Rainer Poisel <rainer@embedded-focus.com>

The help functions require transient at compile time for the =transient-define-prefix= macro and the transient definitions need to be at the top level to satisfy the byte compiler.

This change replaces the use of =/tmp= (~temporary-file-directory~) for caching with the user's cache directory as given by the XDG Base Directory specification or a system-dependent fallback. This avoids permission conflicts when multiple users use agent-shell on a shared system.

Also document Codex non-standard use

If I see it correctly, pngpaste is for MacOS and xclip handles Xorg desktops. This adds ~wl-paste~ for Wayland environments.

`agent-shell-restart` kills the buffer and calls `agent-shell--start` directly after. Since `kill-buffer` will have the effect of changing the default directory to the "other buffer", `agent-shell--start` can possibly end up creating the new shell an unexpected project (e.g if the user's last buffer was in a different project, the new shell will be made for this project instead of the expected behavior of being for the same project the original shell was for). The fix is to bind `default-directory` around calling `agent-shell--start`, since this will retain the default directory across the `kill-buffer` call. `agent-shell-reload` was also affected, since it delegates to `agent-shell-restart`. A new test is added to ensure that, in a frame whose last buffer was another project than the current shell's, calling restart will still use the shell's project as expected. Without the fix, the test fails. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

`font-get :size` returns 0 when the default font is configured via `:height` (1/10pt units) rather than an explicit pixel size. This is common on macOS (e.g., Menlo at `:height 120`). The 0 propagates into the SVG `<text>` elements as `font-size="0"`, rendering all header text invisible while the icon still displays. Fall back to `frame-char-height` when `font-get :size` returns 0. Fixes xenodium#462

The old call to ~call-process~ would merge stdout and stderr into a single stream that would always create ~file-path~ even if no image was in the clipboard. This could then lead downstream code to erroneously assume that an image had been found even though an error was signaled.

Fix xenodium#467

Fixes xenodium#465 When loading a session with `agent-shell-prefer-session-resume` set to nil, `user_message_chunk` notifications replay the conversation history. Non-text content (e.g. images) lacks the `text` field, causing `(wrong-type-argument stringp nil)` when passed to `propertize`. Fall back to a `[type]` placeholder (e.g. `[image]`) when `text` is nil.

Skip make-frame in noninteractive mode; buffer-list ordering achieves the same fallback behavior without needing a terminal frame. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

The rebase introduced an extra closing paren in the agent-shell--permission-title-execute-fenced-test, causing a read syntax error. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

timvisher-dd force-pushed the streaming-dedup branch 2 times, most recently from 3415d07 to 9101647 Compare March 16, 2026 14:26

timvisher-dd changed the title ~~# Support streaming tool output and deduplication~~ Support streaming tool output and deduplication Mar 16, 2026

timvisher-dd force-pushed the streaming-dedup branch 6 times, most recently from df9a77f to 68b7774 Compare March 19, 2026 15:07

timvisher-dd and others added 8 commits March 20, 2026 10:08

Add agent config symlinks for multi-IDE support

a45a08b

Point .claude, .codex, .gemini directories to .agents and their respective markdown files to AGENTS.md so all IDE agents share a single source of truth. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

timvisher-dd force-pushed the streaming-dedup branch from 62b5363 to cb357ac Compare March 20, 2026 14:20

xenodium added 12 commits March 31, 2026 10:31

Update Claude Code icon

8635cb9

Fixing auth-source-pass-get usage in README xenodium#434

e4d65f4

Try to include file name in permission title if missing xenodium#415

4e1bc96

Adds agent-shell-mock-agent.el (needs mock-acp binary installed)

314b2b4

Adding experimental incoming session/pushPrompt

a0044ec

Disable image pasting when running in tui xenodium#435

0710b49

Renaming experimental session/pushPrompt to session/push

4e6ddad

Adding more to CONTRIBUTING.org

a24ebb9

Reject session/push requests when busy

4892364

Show activity indicator while receiving push and ignore out of bound …

3954508

…notification

Adding https://github.com/zackattackz/agent-shell-notifications to RE…

1500196

…ADME

Adds agent-shell-new-downloads-shell and agent-shell-new-temp-shell

6f3ed77

rpoisel and others added 28 commits March 31, 2026 10:32

Add related project agent-circus to README.org

307dce6

Signed-off-by: Rainer Poisel <rainer@embedded-focus.com>

Make id more evident for available models and modes xenodium#452

28f1fb7

Ensure that viewport compiles

0d639cc

The help functions require transient at compile time for the =transient-define-prefix= macro and the transient definitions need to be at the top level to satisfy the byte compiler.

Use project-name instead of default-directory in header

184b4c7

Fix for structured input from toolCall.rawInput.plan

a334503

Make agent-shell--format-plan more forgiving xenodium#438

7c40d53

Also document Codex non-standard use

Enable expanding region/context text for editing xenodium#459

12023b9

Fixes xenodium#455: unhandled method returns an error, unblocking client

2fbe9f7

Text header/modeline improvements xenodium#448

0093e35

Fixing checkdoc warning

25a6178

Ensure button border does not leak into subsequently inserted text

313c0bb

Add wl-paste as a Wayland image handler

b94cd0c

If I see it correctly, pngpaste is for MacOS and xclip handles Xorg desktops. This adds ~wl-paste~ for Wayland environments.

Add documentation about the AOuth Anthropic authentication

b13a9e8

Replacing or + when-let* with if-let* xenodium#463

8fa5b08

Fixes refocus after diff regression xenodium#466

77e747b

Adding README entry for slash commands

6256877

Shortening some agent names

3250af0

Display key bindings in menu tooltips xenodium#448

f6150b6

Adding agent-shell-bookmark to related packages section

f600ce8

Caching project files completions for improved performance

ed5d26a

Fix xenodium#467

Fall back to "unknown" when type is not known xenodium#477

1af38c5

Fix restart test to work in batch mode

bcf36bb

Skip make-frame in noninteractive mode; buffer-list ordering achieves the same fallback behavior without needing a terminal frame. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Fix extra closing paren in permission-title execute test

95c8c17

The rebase introduced an extra closing paren in the agent-shell--permission-title-execute-fenced-test, causing a read syntax error. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

timvisher-dd force-pushed the streaming-dedup branch from a2a672e to 95c8c17 Compare March 31, 2026 14:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support streaming tool output and deduplication#7

Support streaming tool output and deduplication#7
timvisher-dd wants to merge 48 commits intomainfrom
streaming-dedup

timvisher-dd commented Mar 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants

Conversation

timvisher-dd commented Mar 15, 2026

Checklist

Problem

Cause

Fix

Implementation

New files

Changes to agent-shell.el

Tests

Perf measurements

codex-acp

claude-agent-acp

Prerequisite: acp.el changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants

Changes to `agent-shell.el`