xenodium · timvisher-dd · Mar 15, 2026 · Mar 14, 2026 · Mar 14, 2026 · Mar 14, 2026
diff --git a/.agents/commands/live-validate.md b/.agents/commands/live-validate.md
@@ -0,0 +1,68 @@
+# Live validation of agent-shell rendering
+
+Run a live agent-shell session in batch mode and verify the buffer output.
+This exercises the full rendering pipeline with real ACP traffic — the only
+way to catch ordering, marker, and streaming bugs that unit tests miss.
+
+## Prerequisites
+
+- `ANTHROPIC_API_KEY` must be available (via `op run` / 1Password)
+- `timvisher_emacs_agent_shell` must be on PATH
+- Dependencies (acp.el-plus, shell-maker) in sibling worktrees or
+  overridden via env vars
+
+## How to run
+
+```bash
+cd "$(git rev-parse --show-toplevel)"
+timvisher_agent_shell_checkout=. \
+  timvisher_emacs_agent_shell claude --batch \
+  1>/tmp/agent-shell-live-stdout.log \
+  2>/tmp/agent-shell-live-stderr.log
+```
+
+Stderr shows heartbeat lines every 30 seconds.  Stdout contains the
+full buffer dump once the agent turn completes.
+
+## What to check in the output
+
+1. **Fragment ordering**: tool call drawers should appear in
+   chronological order (the order the agent invoked them), not
+   reversed.  Look for `▶` lines — their sequence should match the
+   logical execution order.
+
+2. **No duplicate content**: each tool call output should appear
+   exactly once.  Watch for repeated blocks of identical text.
+
+3. **Prompt position**: the prompt line (`agent-shell>`) should
+   appear at the very end of the buffer, after all fragments.
+
+4. **Notices placement**: `[hook-trace]` and other notice lines
+   should appear in a `Notices` section, not interleaved with tool
+   call fragments.
+
+## Enabling invariant checking
+
+To run with runtime invariant assertions (catches corruption as it
+happens rather than after the fact):
+
+```elisp
+;; Add to your init or eval before the session starts:
+(setq agent-shell-invariants-enabled t)
+```
+
+When an invariant fires, a `*agent-shell invariant*` buffer pops up
+with a debug bundle and recommended analysis prompt.
+
+## Quick validation one-liner
+
+```bash
+cd "$(git rev-parse --show-toplevel)" && \
+  timvisher_agent_shell_checkout=. \
+  timvisher_emacs_agent_shell claude --batch \
+  1>/tmp/agent-shell-live.log 2>&1 && \
+  grep -n '▶' /tmp/agent-shell-live.log | head -20
+```
+
+If the `▶` lines are in logical order and the exit code is 0, the
+rendering pipeline is healthy.
diff --git a/.claude b/.claude
@@ -0,0 +1 @@
+.agents
diff --git a/.codex b/.codex
@@ -0,0 +1 @@
+.agents
diff --git a/.gemini b/.gemini
@@ -0,0 +1 @@
+.agents
diff --git a/.github/workflows/ci.yml b/.github/workflows/ci.yml
@@ -0,0 +1,176 @@
+name: CI
+
+on:
+  push:
+    branches: [main, dev]
+  pull_request:
+    branches: [main]
+
+jobs:
+  readme-updated:
+    if: github.event_name == 'pull_request'
+    runs-on: ubuntu-latest
+    steps:
+      - uses: actions/checkout@v4
+        with:
+          fetch-depth: 0
+
+      - name: Check README.org updated when code changes
+        run: |
+          base="${{ github.event.pull_request.base.sha }}"
+          head="${{ github.event.pull_request.head.sha }}"
+          changed_files=$(git diff --name-only "$base" "$head")
+
+          has_code_changes=false
+          for f in $changed_files; do
+            case "$f" in
+              *.el|tests/*) has_code_changes=true; break ;;
+            esac
+          done
+
+          if "$has_code_changes"; then
+            if ! echo "$changed_files" | grep -q '^README\.org$'; then
+              echo "::error::Code or test files changed but README.org was not updated."
+              echo "Please update the soft-fork features list in README.org."
+              exit 1
+            fi
+          fi
+
+  agent-symlinks:
+    runs-on: ubuntu-latest
+    steps:
+      - uses: actions/checkout@v4
+
+      - name: Verify agent config symlinks
+        run: |
+          ok=true
+          for dir in .claude .codex .gemini; do
+            target=$(readlink "${dir}" 2>/dev/null)
+            if [[ "${target}" != ".agents" ]]; then
+              echo "::error::${dir} should symlink to .agents but points to '${target:-<missing>}'"
+              ok=false
+            fi
+          done
+          for md in CLAUDE.md CODEX.md GEMINI.md; do
+            target=$(readlink "${md}" 2>/dev/null)
+            if [[ "${target}" != "AGENTS.md" ]]; then
+              echo "::error::${md} should symlink to AGENTS.md but points to '${target:-<missing>}'"
+              ok=false
+            fi
+          done
+          if ! [[ -d .agents/commands ]]; then
+            echo "::error::.agents/commands/ directory missing"
+            ok=false
+          fi
+          if [[ "${ok}" != "true" ]]; then
+            exit 1
+          fi
+          echo "All agent config symlinks verified."
+
+  dependency-dag:
+    runs-on: ubuntu-latest
+    steps:
+      - uses: actions/checkout@v4
+
+      - name: Verify require graph is a DAG (no cycles)
+        run: |
+          # Build the set of project-internal modules from *.el filenames.
+          declare -A project_modules
+          for f in *.el; do
+            mod="${f%.el}"
+            project_modules["${mod}"]=1
+          done
+
+          # Parse (require 'foo) from each file and build an adjacency list.
+          # Only track edges where both ends are project-internal.
+          declare -A edges  # edges["a"]="b c" means a requires b and c
+          for f in *.el; do
+            mod="${f%.el}"
+            deps=""
+            while IFS= read -r dep; do
+              if [[ -n "${project_modules[$dep]+x}" ]]; then
+                deps="${deps} ${dep}"
+              fi
+            done < <(sed -n "s/^.*(require '\\([a-zA-Z0-9_-]*\\)).*/\\1/p" "$f")
+            edges["${mod}"]="${deps}"
+          done
+
+          # DFS cycle detection.
+          declare -A color  # white=unvisited, gray=in-stack, black=done
+          found_cycle=""
+          cycle_path=""
+
+          dfs() {
+            local node="$1"
+            local path="$2"
+            color["${node}"]="gray"
+            for neighbor in ${edges["${node}"]}; do
+              if [[ "${color[$neighbor]:-white}" == "gray" ]]; then
+                found_cycle=1
+                cycle_path="${path} -> ${neighbor}"
+                return
+              fi
+              if [[ "${color[$neighbor]:-white}" == "white" ]]; then
+                dfs "${neighbor}" "${path} -> ${neighbor}"
+                if [[ -n "${found_cycle}" ]]; then
+                  return
+                fi
+              fi
+            done
+            color["${node}"]="black"
+          }
+
+          for mod in "${!project_modules[@]}"; do
+            if [[ "${color[$mod]:-white}" == "white" ]]; then
+              dfs "${mod}" "${mod}"
+              if [[ -n "${found_cycle}" ]]; then
+                echo "::error::Dependency cycle detected: ${cycle_path}"
+                exit 1
+              fi
+            fi
+          done
+          echo "Dependency graph is a DAG — no cycles found."
+
+  test:
+    runs-on: ubuntu-latest
+    steps:
+      - uses: actions/checkout@v4
+
+      - uses: actions/checkout@v4
+        with:
+          repository: timvisher-dd/acp.el-plus
+          path: deps/acp.el
+
+      - uses: actions/checkout@v4
+        with:
+          repository: xenodium/shell-maker
+          path: deps/shell-maker
+
+      - uses: purcell/setup-emacs@master
+        with:
+          version: 29.4
+
+      - name: Remove stale .elc files
+        run: find . deps -follow -name '*.elc' -print0 | xargs -0 rm -f
+
+      - name: Byte-compile
+        run: |
+          compile_files=()
+          for f in *.el; do
+            case "$f" in x.*|y.*|z.*) ;; *) compile_files+=("$f") ;; esac
+          done
+          emacs -Q --batch \
+            -L . -L deps/acp.el -L deps/shell-maker \
+            -f batch-byte-compile \
+            "${compile_files[@]}"
+
+      - name: Run ERT tests
+        run: |
+          test_args=()
+          for f in tests/*-tests.el; do
+            test_args+=(-l "$f")
+          done
+          emacs -Q --batch \
+            -L . -L deps/acp.el -L deps/shell-maker -L tests \
+            "${test_args[@]}" \
+            -f ert-run-tests-batch-and-exit
diff --git a/.gitignore b/.gitignore
@@ -1,3 +1,4 @@
 /.agent-shell/
+/deps/
 
 *.elc
diff --git a/AGENTS.md b/AGENTS.md
@@ -17,3 +17,25 @@ When contributing:
 ## Contributing
 
 This is an Emacs Lisp project. See [CONTRIBUTING.org](CONTRIBUTING.org) for style guidelines, code checks, and testing. Please adhere to these guidelines.
+
+## Development workflow
+
+When adding or changing features:
+
+1. **Run `bin/test`.** Set `acp_root` and `shell_maker_root` if the
+   deps aren't in sibling worktrees. This runs byte-compilation, ERT
+   tests, dependency DAG check, and checks that `README.org` was
+   updated when code changed.
+2. **Keep the README features list current.** The "Features on top of
+   agent-shell" section in `README.org` must be updated whenever code
+   changes land. Both `bin/test` and CI enforce this — changes to `.el`
+   or `tests/` files without a corresponding `README.org` update will
+   fail.
+3. **Live-validate rendering changes.** For changes to the rendering
+   pipeline (fragment insertion, streaming, markers, UI), run a live
+   batch session to verify fragment ordering and buffer integrity.
+   See `.agents/commands/live-validate.md` for details. The key command:
+   ```bash
+   timvisher_agent_shell_checkout=. timvisher_emacs_agent_shell claude --batch \
+     1>/tmp/agent-shell-live.log 2>&1
+   ```
diff --git a/CODEX.md b/CODEX.md
@@ -0,0 +1 @@
+AGENTS.md
diff --git a/CONTRIBUTING.org b/CONTRIBUTING.org
@@ -108,6 +108,20 @@ Overall, try to flatten things. Look out for unnecessarily nested blocks and fla
     buffer)
 #+end_src
 
+Similarly, flatten =when-let= + nested =when= by using boolean guard clauses as bindings in =when-let=.
+
+#+begin_src emacs-lisp :lexical no
+  ;; Avoid
+  (when-let ((filename (file-name-nondirectory filepath)))
+    (when (not (string-empty-p filename))
+      (do-something filename)))
+
+  ;; Prefer (use boolean binding as guard clause)
+  (when-let ((filename (file-name-nondirectory filepath))
+             ((not (string-empty-p filename))))
+    (do-something filename))
+#+end_src
+
 ** Prefer =let= and =when-let= over =let*= and =when-let*=
 
 Only use the =*= variants when bindings depend on each other. LLMs tend to default to =let*= and =when-let*= even when there are no dependencies between bindings.
@@ -231,3 +245,20 @@ Tests live under the tests directory:
 Opening any file under the =tests= directory will load the =agent-shell-run-all-tests= command.
 
 Run tests with =M-x agent-shell-run-all-tests=.
+
+*** From the command line
+
+=bin/test= runs the full ERT suite in batch mode.  By default it
+expects =acp.el= and =shell-maker= to be checked out as sibling
+worktrees (e.g. =…/acp.el/main= and =…/shell-maker/main= next to
+=…/agent-shell/main=).  Override the paths with environment variables
+if your layout differs:
+
+#+begin_src bash
+  acp_root=~/path/to/acp.el \
+  shell_maker_root=~/path/to/shell-maker \
+    bin/test
+#+end_src
+
+The script validates that both dependencies are readable and exits
+with a descriptive error if either is missing.