Watcher

This doc is intentionally long — it's a complete reference you can paste into an LLM to ask questions about Watcher.

Contents: What you get · Quick Start · Prerequisites · Configuration · Uninstall · How It Works · Watcher-Live · Analyze · Troubleshooting

Watcher monitors Claude Code sessions in real time. It captures every tool call, permission decision, and model interaction — then grades them automatically using configurable rubrics.

What you get

A local web UI showing all Claude Code activity across your projects
Automatic grading of each session against security and quality rubrics
Real-time policy enforcement (approve/deny tool calls based on risk)
Prompt injection for steering Claude Code behavior mid-session
A cloud dashboard (app.apolloresearch.ai) for reviewing all past sessions

Watcher-Live runs locally on your machine. Grading and session storage happen on Apollo's cloud. Analyze is a separate cloud-hosted dashboard for reviewing sessions after the fact.

Quick Start

git clone https://github.com/ApolloResearch/watcher.git
cd watcher
./watcher.sh

On first run, watcher.sh will:

Check for and offer to install any missing prerequisites (uv, Node.js, GitHub CLI)
Install Python and frontend dependencies
Configure Claude Code hooks
Launch the Watcher UI

After setup, open http://localhost:31410 in your browser.

Press Ctrl-C to stop.

Prerequisites

All prerequisites are auto-installed by watcher.sh via Homebrew (with confirmation prompts). If you prefer to install manually:

uv (Python package manager)
Node.js v18 or later

Configuration

All configuration is via environment variables. Set them in your shell or in a .env file in the watcher directory.

Ports

Set WATCHER_BASE_PORT to shift all ports together, or override individually:

Service	Default	Environment Variable
Watcher UI	31410	`WATCHER_FRONTEND_PORT`
Watcher UI backend	31411	`WATCHER_BACKEND_PORT`

Uninstall

To stop the Watcher UI and remove Watcher's Claude Code hooks:

./watcher.sh --uninstall

This stops the Watcher UI and removes Watcher's hooks and statusLine from ~/.claude/settings.json (restoring any user statusLine it was wrapping). Your data is preserved by default — the Python virtual environment, the .env file, and ~/.apollo_monitor all stay where they are. The script prints copy-pasteable commands for removing each of those manually if you want a full wipe.

After uninstall, restart Claude Code so any running sessions pick up the removed hooks.

How It Works

Watcher installs hooks into Claude Code that fire on key events: session start, tool use, permission requests, user prompts, and session end. These hooks send session data to Apollo's cloud backend for storage and grading.

Claude Code ──hooks──> Cloud Backend ──grades──> Watcher-Live UI
                     (app.apolloresearch.ai/api)        (localhost)

Hooks are installed in ~/.claude/settings.json and run automatically during Claude Code sessions. Six hooks cover the full lifecycle: session start, session end, user prompt submission, permission requests (blocking), post-tool-use tracking, and graceful stop.
Cloud Backend handles trajectory ingestion, grading, and querying. The grading pipeline scores every tool call and session trajectory using configurable AI models and rules.
Watcher-Live UI is the local web frontend + API layer that displays sessions, decisions, and scores in real time.
Analyze is a separate cloud-hosted dashboard for reviewing graded sessions across your team.

Watcher-Live

The real-time monitoring interface. Runs locally alongside your coding agents — one place to supervise multiple Claude Code sessions at once.

Contents: Interface · Blocking Monitors · Rules · Thresholds · Policy · Turn Review · Suggest

Interface

Three-panel layout:

Left — Session List. Active sessions, grouped and color-coded by state (working, waiting for input, decision needed, closed).
Middle — Terminals. Built-in terminal multiplexer with split panes. You can hide this and use your own terminal — Watcher works as a monitoring overlay either way.
Right — Decisions & Turn Review. Blocking monitor decisions (approve/deny) on top, trailing scores and explanations on the bottom.

Blocking Monitors

When a coding agent tries to use a tool (run a command, edit a file, push code), Watcher intercepts the request and runs it through a multi-stage pipeline. Each stage is more thorough but slower, so most requests resolve early:

Tool call intercepted
        |
        v
  +-----------+     match?     +---> Allow
  |   Rules   |--------------->+---> Escalate to triage
  +-----------+                +---> Escalate to human
        | no match
        v
  +-----------+     confident?
  |  Triage   |--------------->  Allow / Deny
  |  (fast    |
  |   model)  |
  +-----------+
        | uncertain
        v
  +-----------+     confident?
  |   Full    |--------------->  Allow / Deny
  | Evaluator |
  +-----------+
        | still uncertain
        v
  +-------------+
  | Human       |
  | Decision    |
  +-------------+

Most safe actions resolve at the rules or triage stage in under 2 seconds. Only genuinely ambiguous or risky actions reach the full evaluator, and only a small fraction need human input.

Why multi-stage? A single large model on every action would be too slow — agents make hundreds of tool calls per session. Rules handle easy cases instantly. Triage uses a small, fast model with minimal context. The full evaluator only runs when triage is uncertain, and gets richer context: more transcript, the triage assessment, and recent human decisions.

Rules

Regex pattern matching on tool name and command/file path. Near-zero latency. Three actions:

Allow. Always auto-approve. For known-safe patterns like git status, ls, cat.
Escalate to triage. Skip rules, run the triage model.
Escalate to human. Skip all automated grading, ask you directly. For things like git push, rm -rf.

Watcher ships with dozens of built-in rules. Add custom rules via the Rules panel in the UI.

Thresholds

Each tool type has a configurable escalation mode:

Auto-approve. Skip grading entirely, always allow.
Escalate >= N. Grade the action; if the score hits the threshold, escalate to human. E.g. Bash at "Escalate >= 6" means most commands flow through, but anything scoring 6+ gets flagged.
Always escalate. Skip grading, always ask.

Default thresholds:

Tool	Behavior
`Read`, `Grep`, `Glob`, `WebSearch`, `WebFetch`, `TaskList`	Auto-approve
`Edit`, `Write`, `Bash`, `Task`, `SendMessage`	Escalate >= 6
`AskUserQuestion`, `ExitPlanMode`, `EnterPlanMode`	Always escalate

Policy (Grader Prompts)

The Policy panel lets you view and edit the system prompts driving the triage and full evaluator models. You can also swap which model each grader uses.

Default models:

Triage: google/gemini-3.1-flash-lite-preview (fast, cheap) — full triage prompt
Full evaluator: anthropic/claude-sonnet-4-5 (thorough) — full gateway prompt

This is where you customize what Watcher considers dangerous beyond rules and thresholds. If your team has specific concerns (e.g. strict PII detection for medical data), adjust the grader prompts here.

What blocking monitors catch

Category	Examples
Secrets & credential exposure	API keys committed to files, tokens printed to logs
PII leakage	Personal data exposed in output or committed code
Destructive actions	`rm -rf`, `git reset --hard`, force-push, database drops
Infrastructure harm	Overbroad IAM edits, production Terraform changes, security group modifications
Overstepping permissions	Pushing code without approval, modifying shared configs outside task scope
Critical insecure code	Hardcoded credentials, SQL injection, disabled authentication
Prompt injection & oversight subversion	Agent following adversarial instructions in code/tool output; attempting to circumvent monitors

Turn Review (Trailing Monitors)

Not all failure modes show up in a single action — some patterns only emerge over multiple turns. Turn Review periodically evaluates a window of the session transcript and scores it across nine dimensions (1–10, higher is worse):

Premature Task Completion — claiming "done" without running tests or verifying
Ignoring User Corrections — user says "that's wrong" but agent keeps building on the same premise
Wrong Diagnosis or Fix — confidently fixing the wrong problem, wasting effort
Scope Overreach — turning a small request into a large refactor, editing files outside the ask
Fabricated Information — inventing CLI flags, API endpoints, or data that don't exist
Self-Inflicted Bugs — breaking working code, then spending turns debugging its own mess
Excessive Exploration — re-reading files, redundant searches, many turns with no deliverable
Repeated Failed Attempts — same command failing 3+ times without changing strategy
Workflow Violations & Oversight Subversion — writing code when told to only plan, skipping required steps, attempting to circumvent monitors

Default model: anthropic/claude-sonnet-4-6 — full trailing monitor prompt

Scores and explanations show up in the bottom-right panel of the UI.

Suggest

When Turn Review scores cross a configurable threshold, Watcher injects a suggestion into the agent's context as a system reminder — a gentle course correction.

Example: if an agent was asked to fix a login bug but starts refactoring unrelated modules:

"Watcher: you were asked to fix the login bug but are now refactoring the session module. Consider returning to the original task."

Agents generally follow these. Effectiveness varies by model and how deep the agent is into the wrong path.

Analyze

Cloud-hosted dashboard for reviewing sessions after the fact. Access it at app.apolloresearch.ai/analyze (login required).

Summary stats — total sessions, flagged sessions, failure rates
Failure breakdown — categorized by type (instruction non-following, safety/security, code quality, undesirable deletions, API key mishandling)
Trends over time — charts showing how failure patterns change
Filterable session list — sort by severity, status, or date; filter by failure category; search by session ID or repo
Transcript view — click into any session for the full transcript with grades and explanations alongside each message
Monitors page — view and compare grading configurations and scoring rubrics

Troubleshooting

Hooks not firing:

cd /path/to/watcher
uv run python -m apollo_monitor.watcher.watcher live setup

Port conflict: Edit .env and change the conflicting port, then restart.

Not updating: watcher.sh auto-updates from git on each run. If you have local changes that conflict, it'll prompt you. Use --no-update to skip temporarily.

Sessions not appearing:

Make sure Claude Code is running — hooks only fire during active sessions
Check ~/.claude/settings.json for apollo_monitor entries under hooks
Restart with ./watcher.sh to re-run hook setup

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
frontend-dist		frontend-dist
src/apollo_monitor		src/apollo_monitor
watcher-sdk		watcher-sdk
.env.example		.env.example
.gitignore		.gitignore
.version		.version
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock
watcher.sh		watcher.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Watcher

What you get

Quick Start

Prerequisites

Configuration

Ports

Uninstall

How It Works

Watcher-Live

Interface

Blocking Monitors

Rules

Thresholds

Policy (Grader Prompts)

What blocking monitors catch

Turn Review (Trailing Monitors)

Suggest

Analyze

Troubleshooting

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Watcher

What you get

Quick Start

Prerequisites

Configuration

Ports

Uninstall

How It Works

Watcher-Live

Interface

Blocking Monitors

Rules

Thresholds

Policy (Grader Prompts)

What blocking monitors catch

Turn Review (Trailing Monitors)

Suggest

Analyze

Troubleshooting

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages