agentplane/docs/developer/architecture.mdx at main · basilisk-labs/agentplane

title	Architecture
description	How the framework control plane, CLI, backends, runner, tasks, and recipes fit together.

What runs where

Agent Plane has three layers with different responsibilities:

CLI runtime (packages/agentplane/src)
Shared models and config (packages/core/src)
Generated schema/spec layer (packages/spec/schemas)

The runtime layer owns command behavior. The core layer owns parsing, defaults, validators, and shared contracts. The spec layer is a generated mirror for tooling and external consumers, synchronized from runtime code into both packages/spec/schemas and packages/core/schemas.

Durable config and task artifact contracts now use Zod in packages/core/src/** as the single source of truth, with generated JSON Schema mirrors emitted for tooling and external consumers.

For the architectural decision and practical boundary guidance, see ADR: Zod Contract SSOT and Schema validation strategy.

Current priority

This page describes the current implementation surfaces. It is not an endorsement of expanding recipe, runner, or prompt-assembly scope immediately.

For the active 0.3 line, the priority is to harden existing task, workflow, PR/integrate, backend, and CLI contracts first. Deeper prompt-assembly and recipe-domain redesign remains deferred until that foundation work settles.

Framework control plane

The framework layer is the reusable control plane that recipes and runner-specific code now consume instead of rebuilding local helper state.

Harness resolution:
- packages/agentplane/src/runtime/harness/*
- Produces ResolvedHarnessContract from repo policy gateway, config, workflow mode, task contract, backend restrictions, and protected-path rules.
Canonical execution context:
- packages/agentplane/src/runtime/execution-context.ts
- Produces AgentplaneExecutionContext / ReadOnlyUsecaseContext with repo, task backend, harness, approvals, capability registry, execution-profile runtime, task-intake runtime, explain payloads, and protocol surfaces.
Behavior precedence and traces:
- packages/agentplane/src/runtime/behavior/*
- Formal precedence is harness -> extension -> user -> builtin, with traceable winners and conflicts.
Capabilities:
- packages/agentplane/src/runtime/capabilities/*
- Unifies backend- and runner-facing capability projection, including unavailable/blocked semantics.
Governance runtime:
- packages/agentplane/src/policy/*
- packages/agentplane/src/runtime/approvals/*
- packages/agentplane/src/runtime/execution-profile/*
- Makes approvals, risky actions, budgets, stop/handoff conditions, trace policy, and timeout policy explicit runtime inputs.
Task intake:
- packages/agentplane/src/runtime/task-intake/*
- Defines TaskIntakeContext, clarification contracts, graph drafts, and materialization plans without binding them to a specific recipe or runner adapter.
Explain and protocol:
- packages/agentplane/src/runtime/explain/*
- packages/agentplane/src/runtime/protocol/*
- Produces machine-readable explain payloads plus additive, versioned protocol/result envelopes that can be persisted and extended without replacing the core model.

CLI runtime map

Command entry and global argument handling:
- packages/agentplane/src/cli/run-cli.ts
Command registry and lazy loading:
- packages/agentplane/src/cli/run-cli/command-catalog.ts
Command handlers:
- packages/agentplane/src/commands/**
Shared module topology:
- Module topology
Task backend integration:
- packages/agentplane/src/backends/task-backend/load.ts
Redmine backend implementation:
- packages/agentplane/src/backends/task-backend/redmine-backend.ts

Core map

Project resolution:
- packages/core/src/project/
Config parsing and defaults:
- packages/core/src/config/
Task document and snapshot primitives:
- packages/core/src/tasks/

Runtime data flow

CLI resolves project root, runtime mode, and .agentplane/config.json.
runtime/harness resolves the repo-level contract into ResolvedHarnessContract.
runtime/execution-context.ts assembles the canonical execution context around that harness.
Behavior, capability, approval, execution-profile, task-intake, explain, and protocol modules derive reusable runtime inputs from the canonical context.
Command handlers and runner usecases consume the same context instead of rebuilding ad hoc task/repo/config helpers.
Backend mutation and task projection update .agentplane/tasks/<task-id>/README.md and optional exported caches such as .agentplane/tasks.json.
Shared runner flows persist run artifacts under .agentplane/tasks/<task-id>/runs/<run-id>/, including machine-readable framework explain/protocol payloads in the bundle.

Task file decision

Current target:

Keep one file per task.
Keep .agentplane/tasks/<task-id>/README.md as the canonical container for now.
Treat frontmatter as the canonical task state, including revision, sections, status, and lifecycle metadata.
Treat the visible markdown body as a generated projection, not as the operational write surface.

Why README stays the target for now:

It preserves the current human-first workflow: one readable task file, stable diffs, and no split between structured state and narrative view.
Canonical frontmatter already gives the system the structured write surface it needed for generated body, migration, and revision groundwork.
Moving to single-file YAML now would impose migration and tooling cost without solving a correctness problem that the current README-plus-frontmatter model still has.

Revisit a future task.yaml target only if at least one of these becomes true:

Frontmatter grows large enough that task files become materially harder to scan than the generated body helps.
Most task mutations become machine-driven and the rendered markdown view stops being the primary human surface.
Non-local backends or external integrations need a stricter structured payload than YAML frontmatter inside README can express cleanly.
Projection repair, normalization, or container-specific drift starts costing more complexity than a format migration would remove.

Harness engineering loop

Agent Plane architecture is designed for an iterative control loop:

Constrain: define scope and policy via contracts/config.
Execute: run small, explicit command steps.
Observe: capture structured artifacts and diagnostics.
Recover: use deterministic fallback/restore paths.
Integrate: merge verified increments quickly.

This loop keeps autonomy useful while preserving operator control and repository legibility.

Upgrade and policy flow

agentplane upgrade applies managed framework assets and can emit upgrade review artifacts.

In agent mode, run artifacts are created under .agentplane/.upgrade/agent/<runId>/ only when a plan is needed (changes to add/update).
When there are no managed changes, agent mode exits via fast no-op and does not create per-run artifacts.
Auto mode uses replace-first behavior for managed files; .agentplane/policy/incidents.md is append-only when local content already exists.

Policy enforcement now resolves through the framework layer and is then consumed by command handlers, runner flows, and guard paths:

ResolvedHarnessContract defines workflow, task, policy, and backend restrictions.
PolicyEngine classifies actions and ApprovalRuntime evaluates gated behavior such as network, fs, git, config, and force mutations.
ResolvedExecutionProfileRuntime carries approval escalation, budgets, stop/handoff conditions, trace policy, and timeout policy.
Runner and recipe-adjacent flows receive the same resolved policy inputs through the canonical context and persisted runner bundle.

The existence of these shared surfaces should not be read as an active mandate to broaden the runner or recipe architecture during 0.3 stabilization.

Release flow

Release is split into planning and apply:

agentplane release plan generates patch candidate and notes inputs.
agentplane release apply updates versions and changelog artifacts.
Local prepublish gates must pass before --push.
npm publish is GitHub-only with provenance enforced by package scripts and workflow checks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What runs where

Current priority

Framework control plane

CLI runtime map

Core map

Runtime data flow

Task file decision

Harness engineering loop

Upgrade and policy flow

Release flow

Related docs

FilesExpand file tree

architecture.mdx

Latest commit

History

architecture.mdx

File metadata and controls

What runs where

Current priority

Framework control plane

CLI runtime map

Core map

Runtime data flow

Task file decision

Harness engineering loop

Upgrade and policy flow

Release flow

Related docs