You are here: Quick tour for curious engineers. Next: 1-Hour Deep Dive → | Adoption TL;DR →
Version note: This tour assumes mainline (
v2.2.0+) or Flow Studio milestone (v0.4.1-flowstudio). If UI elements differ from these screenshots, compare your branch to those tags:git diff v0.4.1-flowstudio -- swarm/tools/flow_studio_ui/
See the swarm SDLC in action. No reading required—just click and explore.
# One time only: install dependencies
uv sync --extra dev
# Download demo artifacts
make demo-run
# Start Flow Studio
make flow-studioFlow Studio opens at
http://localhost:5000/?run=demo-health-check&mode=operator.
You're looking at a 7-flow SDLC with 56 agents. Here's what you see:
┌───────────────────────────────────────────────────────────────────┐
│ Flow Studio [?] [👤 operator] │
├──[Signal ✓ Plan ✓ Build ✓ Review ✓ Gate ✓ Deploy ✓ Wisdom ✓]─────┤
├─────┬───────────────────────────────────────────────────┬────────┤
│ │ │ │
│ 7 │ SDLC Flow Graph │Details │
│flows│ (nodes = steps, colors = agents) │ panel │
│ in │ │ │
│side │ Click nodes to see details │ │
│ bar │ │ │
│ │ │ │
└─────┴───────────────────────────────────────────────────┴────────┘
Left sidebar: 7 flows (click to switch) Center: Step and agent graph for the current flow Top bar: SDLC progress across all flows (all green = complete) Right panel: Details for the selected node (3 tabs: Node, Run, Selftest)
Let's walk Build (Flow 3), where code and tests happen.
-
Click "Build" in the left sidebar (or press
3)- You see 9 steps (colored nodes) and their agents (circles)
-
Click any step node (teal boxes labeled like "test-author")
- Right panel shows:
- Node tab: Step ID, flow, role description
- Run tab: Artifact status (what files this step created)
- Selftest tab: Governance checks that apply to this step
- Right panel shows:
-
Switch to Selftest tab
- See selftest summary (how many KERNEL/GOVERNANCE/OPTIONAL checks exist)
- Click "View Full Plan" to see all 16 selftest steps (1 KERNEL, 13 GOVERNANCE, 2 OPTIONAL)
- Click any step to see:
- Why it matters (KERNEL failures block merges)
- What it validates (linting, contracts, coverage, etc.)
- Commands to run it:
uv run swarm/tools/selftest.py --step <step-id>
-
Click an agent node (colored dots around the flow)
- See agent name, category (shaping, spec, implementation, etc.)
- See which model it uses (inherit, haiku, sonnet)
- See all the flows where this agent appears
You just ran a complete SDLC. Here's what each flow did:
Problem shaping: Turned raw input into requirements and BDD scenarios.
- Agents: signal-normalizer, problem-framer, requirements-author, requirements-critic
- Output:
problem_statement.md,requirements.md,bdd_scenarios.feature,risk_assessment.md
Design decisions: Turned requirements into architecture, contracts, and test plans.
- Agents: impact-analyzer, design-optioneer, adr-author, interface-designer, test-strategist, work-planner
- Output:
adr.md,api_contracts.yaml,observability_spec.md,test_plan.md,work_plan.md
Implementation + test loops: Agents wrote code and tests, critics reviewed, mutator tested.
- Key pattern: microloop (test-author ↔ test-critic, code-implementer ↔ code-critic)
- Agents: context-loader, test-author, test-critic, code-implementer, code-critic, mutator, fixer, doc-writer, self-reviewer
- Output: Source code, tests,
build_receipt.json(audit trail of all decisions)
Pre-merge audit: Checked contracts, security, coverage, policy.
- Agents: receipt-checker, contract-enforcer, security-scanner, coverage-enforcer, merge-decider
- Output:
merge_decision.md(approve / bounce / escalate)
Verification: Merged to main, deployed, ran smoke tests.
- Agents: deploy-monitor, smoke-verifier, deploy-decider
- Output:
deployment_log.md,verification_report.md
Learning extraction: Analyzed what went well, what failed, why. Created issues and updated docs.
- Agents: artifact-auditor, regression-analyzer, flow-historian, learning-synthesizer, feedback-applier
- Output:
artifact_audit.md,regression_report.md,learnings.md,feedback_actions.md
Open this URL:
http://localhost:5000/?run=demo-health-check&compare=health-check-risky-deploy
Now you see side-by-side flows:
- health-check: All green (everything worked)
- health-check-risky-deploy: Gate shows red (deployment was risky, merge blocked)
This is how you diagnose why a run failed: visually compare status across all flows.
You now understand:
-
Flows are structured SDLC spec: Not chat, not magic. Each flow has a defined role, inputs, outputs, agent roster.
-
Steps are checkpoints: Each step has agents assigned, artifacts produced, and governance checks attached.
-
Agents are narrow interns: They specialize (test-author writes tests, test-critic reviews them, they iterate until tests pass).
-
Selftest is the governance gate: 16 steps validate everything (linting → contracts → coverage → policy). Failures block merges tier-by-tier (KERNEL blocks all, GOVERNANCE is conditional, OPTIONAL is informational).
-
Receipts are on disk: All decisions, command outputs, timings are in JSON/Markdown under
swarm/runs/<run-id>/, so you can audit everything.
- Deeper dive: Read docs/FLOW_STUDIO.md (detailed walk, 1 hour)
- Use in your repo: Read docs/ADOPTING_SWARM_VALIDATION.md
- Understand the philosophy: Read docs/WHY_DEMO_SWARM.md
- Run it yourself: Read DEMO_RUN.md for a narrative walkthrough
- Use the API: Read docs/FLOW_STUDIO_API.md to integrate Flow Studio data into dashboards
Press ? in Flow Studio to see all shortcuts:
1–6: Jump to flows (Signal through Wisdom)/: Focus search←/→: Navigate between stepsEsc: Close modals?: Show this help
Flow Studio visualizes an agentic SDLC where agents implement 7 flows (Signal → Plan → Build → Review → Gate → Deploy → Wisdom). Each flow has steps (roles), agents (specialists), and artifacts (proof). Selftest validates everything in tiers (KERNEL/GOVERNANCE/OPTIONAL). You review the receipts, not the process.
That's the whole idea in 30 seconds.