Wave Development Guidelines

Wave is a multi-agent pipeline orchestrator written in Go that wraps Claude Code and other LLM CLIs via subprocess execution. It composes personas, pipelines, contracts, and relay/compaction into a continuous development system.

ACCOUNTABILITY — YOU FOUND IT, YOU FIX IT

URGENT — NON-NEGOTIABLE: This is the single most important rule in this file. It survives context compaction. Re-read it if you are unsure.

If you discover a problem — any problem — you own it. Fix it immediately.

There is NO concept of "pre-existing issue" in this project. If you touched it or saw it break — fix it.

If a validation step reveals issues in files you didn't modify — fix those too.

Never argue about whether something is your responsibility.

Critical Constraints

Single static binary — no runtime dependencies except adapter binaries
Test ownership — every failing test is YOUR concern. Fix or delete (with justification), never ignore. Changes to personas, pipelines, contracts, or meta-pipelines require go test ./...
Security first — all inputs validated, paths sanitized, permissions enforced
Constitutional compliance — navigator-first architecture, fresh memory at step boundaries, contract validation at handovers, ephemeral workspace isolation, observable progress events
Observable execution — structured progress events for monitoring
No backward compatibility constraint during prototype phase — move fast, let tests catch regressions
No t.Skip() without a linked issue. Delete tests only with clear justification

How Wave Works at Runtime

Each pipeline is a topologically-sorted DAG of steps. For every step:

Workspace creation — an ephemeral worktree is created under .agents/workspaces/<pipeline>/<step>/. Steps can share workspaces via workspace.ref. Mounts support readonly/readwrite modes
Artifact injection — outputs from prior steps are injected into .agents/artifacts/ before execution begins. The system validates existence, enforces optional/required semantics, and checks schemas if ref.SchemaPath is specified
Runtime CLAUDE.md assembly — a per-step CLAUDE.md is generated from four layers:
- Base protocol preamble (.agents/personas/base-protocol.md)
- Persona system prompt (role, responsibilities, constraints)
- Contract compliance section (auto-generated from step contract schema)
- Restriction section (denied/allowed tools, network domains from manifest permissions)
Adapter execution — the persona runs in isolated context with fresh memory (no chat history inheritance)
Contract validation — step output is validated against its contract (json_schema, typescript_interface, test_suite, markdown_spec, format) before marking the step successful. Hard failures block; soft failures log warnings

Key source files: internal/pipeline/executor.go, internal/adapter/claude.go, internal/contract/, internal/workspace/

Architecture

Active Technologies

Go 1.25+ with gopkg.in/yaml.v3, github.com/spf13/cobra
SQLite for pipeline state, filesystem for workspaces and artifacts

Core Components

Manifests (wave.yaml) — single source of truth for configuration
Personas — AI agents with specific roles, permissions, and system prompts
Pipelines — multi-step workflows with dependency resolution
Contracts — output validation (JSON schema, TypeScript, test suites)
Workspaces — ephemeral isolated execution environments
State Management — SQLite-backed persistence and resumption

Security Model

Fresh memory at every step boundary — no chat history inheritance
Permission enforcement with deny/allow patterns — strictly enforced
Ephemeral workspaces — isolated filesystem execution
Contract validation — all outputs validated before step completion
Audit logging — credential scrubbing and tool call tracking

File Structure

internal/
├── adapter/      # Subprocess execution and adapter management
├── attention/    # Attention classifier driving dashboard notification badges
├── audit/        # Audit logging and credential scrubbing
├── bench/        # SWE-bench benchmarking and comparison
├── checks/       # Shared host-capability probes for preflight and doctor
├── classify/     # Issue domain/complexity classification and pipeline selection
├── continuous/   # Continuous pipeline execution
├── contract/     # Output validation (JSON, TypeScript, test suites)
├── cost/         # Cost ledger, iron rule enforcement, model pricing
├── defaults/     # Embedded default personas, pipelines, and contracts
├── display/      # Terminal progress display and formatting
├── doctor/       # Project health checking and optimization
├── event/        # Progress event emission and monitoring
├── fileutil/     # File and directory copy helpers
├── forge/        # Git forge/hosting platform detection (GitHub, GitLab, Gitea, Forgejo, Codeberg, Bitbucket, local)
├── github/       # GitHub API integration for issue enhancement
├── hooks/        # Lifecycle hooks and webhook delivery runner
├── humanize/     # Human-readable formatting helpers (durations, counts, sizes)
├── manifest/     # Configuration loading and validation
├── onboarding/   # Interactive wave init flow (monorepo-aware, Docker compose, flavour detection)
├── pathfmt/      # Path formatting and normalization utilities
├── pipeline/     # Pipeline execution, step management, model routing, decision logging
├── preflight/    # Pipeline dependency validation and auto-install
├── recovery/     # Pipeline recovery hints and error guidance
├── relay/        # Context compaction and summarization
├── retro/        # Run retrospective generation
├── sandbox/      # Docker and bubblewrap sandbox backends
├── scope/        # Persona token scope parsing and validation
├── security/     # Security validation and sanitization
├── skill/        # Skill discovery, provisioning, and command management
├── state/        # SQLite persistence, outcome tracking, webhooks, decision log
├── suggest/      # Pipeline suggestion engine
├── testutil/     # Shared test helpers (event collectors, state mocks, manifest fixtures)
├── timeouts/     # Default timeout constants (leaf package, breaks import cycles)
├── tools/        # Shared CLI tool availability checks
├── tui/          # Bubble Tea terminal UI
├── webui/        # Web operations dashboard (runs, pipelines, webhooks, admin, analytics)
├── worktree/     # Git worktree lifecycle for isolated workspaces
└── workspace/    # Ephemeral workspace management

cmd/wave/         # CLI command structure
tests/            # Test coverage
.agents/            # Default personas, pipelines, contracts

Active Adapters

Adapter	Binary	Model Format	Instruction File
Claude	`claude`	Short names or full IDs (`sonnet`, `haiku`)	`CLAUDE.md`
OpenCode	`opencode`	`provider/model` (`zai-coding-plan/glm-5-turbo`)	`AGENTS.md`
Gemini	`gemini`	Plain names (`gemini-2.0-pro`)	`GEMINI.md`
Codex	`codex`	`provider/model` or plain names	`AGENTS.md`

Override Hierarchy (strongest to weakest)

Tier	Source	Adapter	Model
1	CLI flags	`--adapter <name>`	`--model <model>`
2	Pipeline step YAML	`adapter:`	`model:`
3	Persona in wave.yaml	`adapter:`	`model:`
4	Adapter defaults	binary default	empty / auto-route

wave run my-pipeline --adapter opencode --model "zai-coding-plan/glm-5-turbo"
wave run my-pipeline --adapter gemini --model "gemini-2.0-pro"

Security

All user input sanitized for prompt injection; file paths validated against approved directories
Persona permissions strictly enforced at runtime; deny rules projected into settings.json AND runtime CLAUDE.md
Outer sandbox: Nix dev shell with bubblewrap (read-only FS, hidden $HOME, curated env)
Adapter sandbox: settings.json sandbox settings with network domain allowlisting
Prompt restrictions: runtime CLAUDE.md restriction section generated from manifest
Environment hygiene: only runtime.sandbox.env_passthrough vars reach subprocesses
No credentials on disk; sanitized logging; workspace isolation prevents data leakage

Development

Code Standards

Follow effective Go practices (gofmt, go vet), single responsibility per package
Use interfaces for testability and dependency injection
Comprehensive error types with structured details
Table-driven tests with edge case coverage

Testing

go test ./...            # Run all tests
go test -race ./...      # Run with race detector (required for PR)
golangci-lint run ./...  # Run static analysis linters

See docs/migrations.md for database migration documentation.

Git Commits

No Co-Authored-By — never include Co-Authored-By lines in commit messages
No AI attribution — do not add "Generated with Claude Code" or similar
Use conventional commit prefixes: feat:, fix:, docs:, refactor:, test:, chore:

Versioning

Automated semantic versioning from conventional commits. Every merge to main produces a release.

Commit prefix	Bump	Example
`fix:`, `docs:`, `refactor:`, `test:`, `chore:`	patch (0.0.X)	v0.1.0 → v0.1.1
`feat:`	minor (0.X.0)	v0.1.1 → v0.2.0
`BREAKING CHANGE:` or `!:` (e.g. `feat!:`)	major (X.0.0)	v0.2.0 → v1.0.0

Debugging

Use --debug flag for detailed execution logging
Check .agents/traces/ for audit logs
Workspace contents preserved for post-mortem analysis

Wave Swarm Orchestration

When acting as the core orchestrator (the Claude instance steering Wave pipelines), follow these patterns:

Pipeline Selection

Fleet after the 2026-04 WLP (ADR-011) consolidation + ops-pr-respond showcase: 39 pipelines under .agents/pipelines/ (28 default-embedded under internal/defaults/pipelines/). All shipped pipelines satisfy the 7 WLP rules (typed I/O, explicit output types, deterministic contracts, canonical artifact paths, sub-pipeline composition, iterate over typed collections, input_ref.from field navigation).

Issue complexity	Pipeline	When to use
Bug fix, small tweak	`impl-issue`	Single-file or few-file changes, clear scope
Medium feature	`impl-issue`	Well-scoped feature with clear acceptance criteria
Complex feature	`impl-speckit`	Multi-component changes, needs spec → plan → tasks → impl
Architecture change	`impl-speckit`	Touches 5+ files, needs design discussion
Divergent-convergent simplification	`impl-recinq`	Double Diamond code simplification
Epic runner	`ops-epic-runner`	Scope an epic and sequentially implement every child
Feature inception (scope + research + impl + review)	`inception-feature`	End-to-end feature delivery composed from core blocks
Bug inception	`inception-bugfix`	End-to-end bug triage + fix + review
Doc inception	`inception-doc`	End-to-end doc fix composition
Audit inception	`inception-audit`	Triage + merge of multiple audit passes
Hardening inception (Wave self)	`inception-harden`	Release-hardening for Wave itself; not shipped in defaults (composes wave-* self-evolution sub-pipelines)
Code quality	`audit-duplicates`, `audit-tests`, `audit-architecture`	Analysis passes feeding findings_report
Dead code / docs	`audit-dead-code-scan`, `audit-doc-scan`	Single-step scans used as Lego blocks
Security	`audit-security`, `wave-security-audit`	Security scanning (any project / Wave itself)
PR review	`ops-pr-review`	Always run before merging any PR; close the loop with `ops-pr-respond` when fixes are wanted
PR review-to-resolution	`ops-pr-respond`	Six-axis parallel review → triage → per-finding fix (sub-pipeline `impl-finding`) → verify → structured PR comment with finding-→-SHA mapping. Showcase pipeline that exercises most Wave primitives (composition, parallel iterate, aggregate, loop, json_schema, agent_review, test_suite); branch primitive is documented in spec but deferred — see spec.md.
Per-finding fix block	`impl-finding`	Sub-pipeline used by `ops-pr-respond` to apply one triaged finding on a PR head branch
Audit-only issue (analysis → doc + screenshots + follow-ups + PR)	`audit-issue`	Pick this instead of `impl-issue` whenever the issue's deliverable is analysis (file:line tables, annotated screenshots, runtime examples) and the result should be a checked-in audit doc plus auto-filed follow-up issues, not a code-→-PR delivery. Composition mirror of `ops-pr-respond` for the audit shape: fetch-issue → 4-axis parallel evidence (webui screenshots, code walk, db trace, event trace) → aggregate → enumerate-gaps → per-gap deepdive (`gap-analyze` sub-pipeline) → synthesize with agent_review against `audit-doc-criteria.md` → file-each-followup (iterate.serial, `gh issue create`) → create-pr. Deliverables: `docs/<slug>-audit.md` with inlined screenshots, one filed follow-up issue per high/critical gap (URLs in `followup-refs.json`), and a PR linking the doc to the follow-ups.
Per-gap deepdive block	`gap-analyze`	Sub-pipeline used by `audit-issue` to refine severity and draft a typed `followup-spec` for one gap. Analog of `impl-finding` for the audit shape.
Audit evidence axes (sub-pipelines)	`audit-webui-shots`, `audit-code-walk`, `audit-db-trace`, `audit-event-trace`	Single-step evidence-axis pipelines composed by `audit-issue`. Run any standalone for ad-hoc inspection (e.g. `wave run audit-code-walk --input "internal/webui/composition.go:520"`).
Wave evolution	`wave-audit`, `wave-scope-audit`, `wave-test-hardening`	Self-evolution of Wave
Wave validation	`wave-validate`, `wave-smoke-gates`, `wave-smoke-contracts`	Smoke tests for gates + contracts
Epic decomposition	`plan-scope`	Decompose an epic into child issues
Issue research	`plan-research`	Research an issue and post findings
Task planning	`plan-task`	Break down a scoped task
Greenfield scaffolding	`ops-bootstrap`	Cold-start onboarding for a new project
PR review core	`ops-pr-review-core`	Core review block composed by wrappers
Issue quality	`ops-issue-quality`	Quality triage of an issue
Parallel audit	`ops-parallel-audit`	Run multiple audits in parallel and aggregate
Smoke: hello world	`ops-hello-world`	Baseline smoke test
Doc explain	`doc-explain`	Produce an explanatory document
Doc onboard	`doc-onboard`	Produce onboarding documentation

PR Review-Then-Merge Protocol

MANDATORY: Never merge a PR without a review. Before launching any pipeline, check existing state first to avoid redundant work.

Check PR state first: gh pr view <N> --json reviews,comments — if a completed ops-pr-review already posted a review, do NOT re-run the review pipeline
Check for existing pipeline runs: wave list runs --limit 20 — if a review run already completed for this PR, skip re-running
Only launch review if no prior review exists: wave run -v ops-pr-review -- "<PR-URL>" &
Wait for review completion and check results
Only merge after review passes
Check for leaked files: gh pr diff <N> --name-only | grep -E "^\.claude/|^\.agents/artifacts/|^\.agents/output/"

Concurrency Rules

Maximum 6 concurrent pipelines — beyond this, API rate limits and CPU contention degrade quality
Optimal: 3-5 concurrent pipelines — best throughput-to-quality ratio
Launch pipelines via wave run --detach -v <pipeline> -- "<input>" (detached — survives shell exit)
Monitor with wave list runs --limit N and wave logs <run-id>

Issue Triage

Before launching a pipeline for an issue:

Check if already implemented — search codebase for the feature
Check for duplicates/superseded — compare with other open issues
Assess implementability — implementable: false is correct behavior, not a failure
Close non-actionable issues with a comment explaining why

Category	Action	Pipeline
close	Already implemented, superseded, or duplicate	Close with comment
implement	Well-scoped, single-PR implementation	`impl-issue`
impl-speckit	Complex, needs spec → plan → tasks → implement	`impl-speckit`
defer	Needs design discussion, experiment, or blocked	Leave open

Monitoring

wave list runs --limit N                                          # Status overview
wave logs <run-id> | grep -E "completed|running|validating"       # Step transitions
wave logs <run-id> | grep "stream_activity" | tail -3             # Latest activity

Post-Pipeline PR Validation

Check if review already exists: gh pr view <N> --json reviews,comments — skip ops-pr-review if one already completed
If no review exists, run ops-pr-review: wave run -v ops-pr-review -- "<PR-URL>" &
Check for leaked files: gh pr diff <N> --name-only | grep -E "^\.claude/|^\.agents/artifacts/|^\.agents/output/"
After review passes: gh pr merge <N> --merge

Failure Recovery

Merge conflicts: Close old PR, git pull origin main, re-run pipeline
No-op PRs: Close PR, re-run — review pipeline catches these
Contract failures: Pipeline retries automatically
Rate limits: Reduce concurrency, wait, re-run

Batch PR Merge Protocol

Merge oldest/smallest PRs first to minimize cascading conflicts
Check remaining PRs for conflicts after each merge
Close CONFLICTING PRs and re-run from updated main
Pull main after batch: git pull origin main

Custom Pipeline Tips

Rapid prototyping: Use on_failure: skip in contract blocks when creating new custom pipelines. This lets the pipeline complete even without schema files, making iteration fast. Graduate to on_failure: retry once the pipeline stabilizes.
Scaffolding: Use wave pipeline create --name my-pipeline --template impl-hotfix to scaffold from an existing template.
Persona scaffolding: Use wave persona create --name my-persona --template researcher to scaffold a custom persona.

Constraints

NEVER write contract or artifact schemas in prompts. Wave has to parse, validate and inject them properly into the proper pipeline step. Exception: gh pr create --body-file .agents/artifacts/<name> and similar CLI commands that require a literal file path are acceptable — the persona needs the path to pass to external tools.
NEVER duplicate information in step prompts that Wave already injects at runtime. buildContractPrompt() automatically injects: (a) output artifact file paths and write instructions, (b) full contract schema content for json_schema contracts, (c) injected input artifact paths and read instructions (## Available Artifacts section). Therefore prompts must NOT specify: where to write output files, where to read injected artifacts, JSON field enumerations, or output structure that matches a contract schema. Reference artifacts by their as: alias name only (e.g., "Read the greeting_file artifact"), never by path. Prompts should focus on reasoning, constraints, quality bar, and domain-specific guidance. For markdown outputs without a schema contract, describing the expected section structure is acceptable since Wave only injects the file path, not the format.
NEVER pass validations silently. If a validation fails, it must be reported as an error and the step should not complete successfully.
NEVER make bulk-edits to the codebase except it is not functional code; BEWARE personas, pipelines etc. are all functional code and should be edited with the same care as any other code: Individually!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wave Development Guidelines

ACCOUNTABILITY — YOU FOUND IT, YOU FIX IT

Critical Constraints

How Wave Works at Runtime

Architecture

Active Technologies

Core Components

Security Model

File Structure

Active Adapters

Override Hierarchy (strongest to weakest)

Security

Development

Code Standards

Testing

Git Commits

Versioning

Debugging

Wave Swarm Orchestration

Pipeline Selection

PR Review-Then-Merge Protocol

Concurrency Rules

Issue Triage

Monitoring

Post-Pipeline PR Validation

Failure Recovery

Batch PR Merge Protocol

Custom Pipeline Tips

Constraints

FilesExpand file tree

AGENTS.md

Latest commit

History

AGENTS.md

File metadata and controls

Wave Development Guidelines

ACCOUNTABILITY — YOU FOUND IT, YOU FIX IT

Critical Constraints

How Wave Works at Runtime

Architecture

Active Technologies

Core Components

Security Model

File Structure

Active Adapters

Override Hierarchy (strongest to weakest)

Security

Development

Code Standards

Testing

Git Commits

Versioning

Debugging

Wave Swarm Orchestration

Pipeline Selection

PR Review-Then-Merge Protocol

Concurrency Rules

Issue Triage

Monitoring

Post-Pipeline PR Validation

Failure Recovery

Batch PR Merge Protocol

Custom Pipeline Tips

Constraints