Execution Model

Taskplane task execution is a persistent-context loop with file-backed memory.

Core idea:

each worker iteration starts with fresh model context
the worker handles all remaining steps in a single context
the worker drives reviews inline via the review_step tool
STATUS.md is the persistent execution memory
progress is checkpointed continuously

Lifecycle overview

/orch <task or area>
  → allocate lane (isolated git worktree)
  → parse task
  → load or generate STATUS.md
  → iteration loop:
      spawn worker with all remaining steps
      worker works through steps in order:
        - plan review (via review_step tool, if level ≥ 1)
        - implement step
        - commit changes
        - code review (via review_step tool, if level ≥ 2)
        - if REVISE: address feedback, commit fixes
        - proceed to next step
      after worker exits, check what was completed
      if all steps complete → break
      if context limit hit → next iteration picks up from incomplete step
  → (optional) quality gate review
  → create .DONE
  → merge into orch branch

Phase 1: Task initialization

When a task starts executing in a lane:

Resolve and parse PROMPT.md
Load config (JSON first, YAML fallback)
Ensure STATUS.md exists (generate if missing)
Ensure .reviews/ directory exists
Enter running phase
Context window auto-detected from pi model registry (v0.8.0+)

If STATUS.md already exists, review counter and iteration values are rehydrated.

Phase 2: Step execution

Steps are parsed from ### Step N: ... headings.

The worker is spawned once per iteration and told to work through all remaining (incomplete) steps in order. This preserves accumulated context across step boundaries, avoiding the re-hydration cost of spawning a fresh worker per step.

Each iteration:

Identify all incomplete steps
Spawn worker with the full list of remaining steps
Worker works through steps sequentially:
- Calls review_step(type="plan") before implementing (if review level ≥ 1)
- Implements the step
- Commits at step boundary
- Calls review_step(type="code") after implementing (if review level ≥ 2)
- If REVISE: reads feedback, addresses issues, commits fixes
- Proceeds to next step
Worker exits (naturally, via wrap-up signal, or context limit)
Runner determines which steps were newly completed
If all steps complete, task is done; otherwise start next iteration

Worker-driven reviews (v0.9.0+)

Reviews are driven by the worker agent via the review_step extension tool. The worker decides when to review based on the task's review level. The reviewer runs as a persistent agent — one per task, staying alive across all reviews via the wait_for_review tool. This preserves the reviewer's context across step reviews. If the persistent session dies, it falls back to a fresh spawn.

Review Level 0: No reviews
Review Level 1: Plan review before implementing each step
Review Level 2: Plan review + code review after implementing
Review Level 3: Plan + code + test reviews

Low-risk step exception: Step 0 (Preflight) and the final step (Documentation & Delivery) always skip reviews. The worker template instructs this and the tool handler enforces it as a safety net.

See Review Loop for full details.

Worker iteration loop

Each iteration:

Re-read STATUS.md
Determine all remaining incomplete steps
Spawn worker agent with task context + project context + remaining steps list
Worker works through steps in order, invoking reviews inline
Worker updates STATUS.md and checkpoints changes continuously
Runner checks total progress across all steps after worker exits

Guardrails:

max_worker_iterations
no_progress_limit (checked per iteration across all steps)
context pressure thresholds (warn_percent default 85%, kill_percent default 95%)
optional wall-clock cap (max_worker_minutes, default 120 min)

If no progress repeats beyond limit, the task is marked blocked/error.

Supervisor-in-the-loop exit interception (TP-172)

When a worker exits without making visible progress (no checkboxes checked, no blocker logged), the lane-runner can intercept the exit before closing the worker's process. The worker's conversation context is preserved while the lane-runner escalates to the supervisor with the worker's last assistant message, current step, and unchecked items.

The supervisor can then:

Send targeted instructions via send_agent_message → the worker continues with its full conversation context plus the new guidance
Reply "skip" or "let it fail" → the session closes normally

Each session can be intercepted at most 2 times (configurable via maxExitInterceptions). After the limit, or if the supervisor doesn't respond within 60 seconds, the session closes and the normal corrective re-spawn mechanism takes over.

Context window auto-detect (v0.8.0+)

The worker's context window is auto-detected from pi's model registry. For Claude 4.6 Opus, this is 1M tokens; for Bedrock variants, 200K. The hardcoded 200K default is only a fallback when pi doesn't report the model's context size. Users can still override via worker_context_window in config.

Context overflow recovery

If the worker hits the context limit mid-task, it exits and the next iteration picks up from the first incomplete step via STATUS.md — the same recovery mechanism as any other worker exit, just triggered by context pressure instead of natural completion.

Model fallback recovery (TP-055)

When a configured model becomes unavailable mid-batch (401/403/429, model deprecated, API key expired), the exit is classified as model_access_error. If taskRunner.modelFallback is "inherit" (default), the orchestrator automatically retries the task with the session model — one attempt only. If the session model also fails, normal failure handling applies.

STATUS.md as persistent memory

STATUS.md is the durable source of truth for:

current step
checkbox state
review counter
iteration count
execution log

Because state is on disk, execution can be paused/resumed and recovered across session restarts.

Checkpoint discipline

Taskplane's worker prompt enforces checkpoint behavior:

complete one checkbox item
update STATUS checkbox
commit checkpoint at step boundaries

This makes progress granular, auditable, and recoverable.

Pause and resume

/orch-pause: sets pause signal; current tasks finish before pausing
/orch-resume [--force]: restarts from persisted state
On batch failure, the supervisor can resume programmatically via the orch_resume tool

Completion semantics

A task is complete when the worker finishes all steps and writes:

<task-folder>/.DONE

Quality gate (opt-in)

When the quality_gate config is enabled, a structured review runs after all steps complete but before .DONE creation. A cross-model review agent evaluates the task output and produces a JSON verdict (PASS or NEEDS_FIXES) with severity-classified findings.

PASS: .DONE is created normally.
NEEDS_FIXES: A remediation cycle begins — a fix agent addresses blocking findings, then the review reruns. This repeats up to the configured cycle limits (max_review_cycles, max_fix_cycles).
Cycles exhausted: If the maximum cycles are reached without a PASS, the task enters error state. .DONE is not created.
Fail-open: If the review agent crashes, times out, or produces malformed/missing output, the verdict defaults to PASS so infrastructure failures never block task completion.

When disabled (default), .DONE is created immediately after all steps complete — no behavioral change.

See task-runner.yaml Reference for configuration details.

In orchestrated mode, the runner creates .DONE and lets the orchestrator handle the merge lifecycle. When all tasks in a wave complete, an LLM-powered merge agent combines the lane branches into the orch branch — resolving any conflicts semantically and running verification tests. See Merge and Conflict Resolution for details.

Failure semantics

Tasks can enter error phase due to:

parse failures
worker/reviewer spawn errors
no-progress threshold exceeded
iteration limits exceeded
explicit runtime errors

Status and logs remain on disk for diagnosis. The supervisor agent can diagnose failures and offer recovery options.

Why persistent-context loops

The persistent-context model (v0.8.0+) spawns one worker per task instead of per step. The worker maintains full context across step boundaries, eliminating costly re-hydration. If the context window is exhausted mid-task, the iteration mechanism provides a clean recovery path via STATUS.md.

Tradeoff:

workers use more of the context window per iteration
but dramatically fewer spawns and lower token cost
reviews happen inline with full context (worker addresses REVISE immediately)

Waves, Lanes, and Worktrees — parallel execution and worktree isolation
Merge and Conflict Resolution — how parallel task output is combined
Review Loop
Task Format Reference
Commands Reference
Persistence and Resume

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Execution Model

Lifecycle overview

Phase 1: Task initialization

Phase 2: Step execution

Worker-driven reviews (v0.9.0+)

Worker iteration loop

Supervisor-in-the-loop exit interception (TP-172)

Context window auto-detect (v0.8.0+)

Context overflow recovery

Model fallback recovery (TP-055)

STATUS.md as persistent memory

Checkpoint discipline

Pause and resume

Completion semantics

Quality gate (opt-in)

Failure semantics

Why persistent-context loops

Related

FilesExpand file tree

execution-model.md

Latest commit

History

execution-model.md

File metadata and controls

Execution Model

Lifecycle overview

Phase 1: Task initialization

Phase 2: Step execution

Worker-driven reviews (v0.9.0+)

Worker iteration loop

Supervisor-in-the-loop exit interception (TP-172)

Context window auto-detect (v0.8.0+)

Context overflow recovery

Model fallback recovery (TP-055)

STATUS.md as persistent memory

Checkpoint discipline

Pause and resume

Completion semantics

Quality gate (opt-in)

Failure semantics

Why persistent-context loops

Related