GitHub - lbruton/specflow: A Model Context Protocol (MCP) server that provides structured spec-driven development workflow tools for AI-assisted software development, featuring a real-time web dashboard and VSCode extension for monitoring and managing your project's progress directly in your development environment.

Spec-driven development with persistent project memory and semantic code intelligence.

View the full interactive about page • Case Study: Forge • Changelog

AI agents forget everything between sessions. They lose decisions, repeat mistakes, and drift from reality. SpecFlow gives them a structured lifecycle, persistent cross-project memory, and semantic code intelligence -- self-hosted core with optional cloud integrations.

Built on Pimzino/spec-workflow-mcp's core engine (sequential spec workflow, real-time dashboard, blocking approval gates). This fork layers extended lifecycle phases, multi-project orchestration, three-tier knowledge architecture, and semantic code search on top.

Four Systems, One Workflow

System	What It Does
SpecFlow (MCP Server)	Spec-driven lifecycle: Requirements → Design → Tasks → Implementation with dashboard approvals at every gate. 6 tools, 10 prompts.
DocVault (Obsidian Vault)	Cross-project knowledge base. One vault serves 8+ repos -- architecture, infrastructure, decisions, issues. Graph visualization + wikilinks.
Code Context (Milvus)	Semantic code search via self-hosted vector database. Search by meaning, not keywords. Forked from Zilliz/claude-context, hardened with timeouts and pinned versions.
Skill System (60+ Skills)	CLAUDE.md stays tiny -- a routing table to skills. Each skill encodes a full workflow: debugging, deployment, PR resolution, infrastructure management.

Three-Tier Memory Architecture

Not everything belongs in one file. Each tier has a purpose and a source of truth ranking.

Tier	System	Role
1	DocVault	Ground truth. Human-curated Obsidian vault. Wins all conflicts.
2	File Memory	Session context. Project-scoped markdown at `~/.claude/projects/*/memory/`.
3	mem0	Episodic recall. Semantic retrieval from session digests. Never authoritative. Cloud API by default; self-hosted fork planned.

Continuous Learning Loop

Every session learns from the previous. This is the single biggest differentiator.

/prime (session start)         /wrap (session end)
  ├─ Index codebase (~15s)       ├─ Cleanup (stale branches, uncommitted work)
  ├─ Read recent digests         ├─ /vault-update (documentation sync)
  ├─ Pull mem0 memories          ├─ /retro (prescriptive lessons → mem0)
  ├─ Check issues + git          └─ /digest-session (JSONL → DocVault + mem0)
  ├─ Optional: --deep mode
  └─ "Here's where you left off"
                                 /audit (on-demand health check)
  Tomorrow's /prime reads          ├─ Code quality + security scan
  today's digest + retro           ├─ Documentation drift detection
  lessons automatically            ├─ Issue staleness check
                                   └─ Actionable remediation report

Spec Workflow Lifecycle

Every non-trivial feature follows the same path. Approvals required at each gate.

/prime (start) → /chat → /discover → /spec → Design → Implement → /audit (health) → /wrap (close)

Bug fast path: /systematic-debugging → issue → fix (skip discovery/spec)
Casual path:   /gsd — no issue, no spec, chore: PR

Parallel Subagent Dispatch

Tasks don't execute sequentially. Each runs through an isolated three-stage pipeline:

Orchestrator reads task
  → Implementer Agent      (fresh context, writes code, tests, commits)
  → Compliance Reviewer    (reads actual code vs requirements)
  → Quality Reviewer       (architecture, error handling, readiness)
  → log-implementation     (record artifacts)
  → mark complete

Tasks with zero file overlap execute concurrently in batches.

Case Study: Forge -- empty repo to deployed production app in 3 hours, 23 tasks across 30 parallel subagents, 8 dashboard approval gates, zero file conflicts.

Code Intelligence

Agents shouldn't grep blindly through your codebase. Four search tiers, cheapest first:

Tier	Engine	Query Style
1	Code Graph Context (Neo4j)	Structural: "what calls this function?"
2	Code Context (Milvus)	Semantic: "find code related to payment processing"
3	Grep / Glob	Literal: exact strings, filenames, identifiers
4	Code Oracle Agent	Deep analysis: combines all sources + AI reasoning

Code Context is a hardened fork of Zilliz's MCP server -- self-hosted Milvus, 30s timeouts, pinned npm versions. No collection limits, full data sovereignty. Embedding generation requires a cloud API (OpenAI or compatible) or a local model via Ollama.

Comparison

Dimension	SpecKit	BMAD	GSD	Taskmaster	mex	Pimzino	SpecFlow
Approval gates	None	Advisory	UAT	None	None	Dashboard	Dashboard + skills
Memory	constitution.md	Git docs	STATE.md	tasks.json	Scaffold	Steering docs	3-tier
Session learning	None	None	None	None	GROW loop	None	/prime → /wrap
Code search	None	None	None	None	None	None	Semantic + structural
Multi-project	Per-repo	Per-repo	Per-repo	Per-repo	Per-repo	Per-repo	One vault, all repos
Infrastructure	Code only	Code only	Code only	Code only	Code only	Code only	Docker, DNS, VMs
Drift detection	None	None	None	None	8 checkers	None	/vault-update gate
Self-hosted	Files	Files	Files	Files	Files	Node.js	Milvus, Neo4j + cloud optional
Best for	Quick adoption	Enterprise teams	Solo context eng.	PRD pipelines	Per-repo memory	Structured workflow	Multi-project governance

Multi-Agent Support

SpecFlow works as an MCP server, which means any agent that speaks the MCP protocol can use it. Verified with all three major coding agents:

Agent	MCP Loading	Spec Lifecycle	Skills
Claude Code	MCP via npm, plugin via git clone	Full	60+ skills via SKILL.md
Gemini CLI	Manual MCP config	Full	Via GEMINI.md instructions
Codex CLI	Manual MCP config	Full	Via CODEX.md instructions

All three agents share the same MCP tools, DocVault knowledge base, and spec workflow. Agent-specific instruction files (CLAUDE.md, GEMINI.md, CODEX.md) tailor behavior to each agent's capabilities.

Cross-Agent Spec Handoff

Spec state lives on disk in DocVault (DocVault/specflow/{project}/specs/) — not in any agent's memory. Each project has a thin .specflow/config.json pointer. This means you can start a spec in one agent and continue in another:

Claude Code                    Codex CLI                      Gemini CLI
────────────                   ─────────                      ──────────
/issue create                  @spec resume                   @spec resume
  → creates SWF-65               → reads spec from disk         → reads spec from disk
/discover SWF-65                 → runs Phase 3 (Tasks)          → runs Phase 4 (Implement)
  → discovery brief               → generates task list            → implements tasks
/spec SWF-65                     → writes tasks.md                → commits code
  → Phase 1 (Requirements)      → awaits approval                → logs implementation
  → Phase 2 (Design)
  → awaits approval

Each agent reads the current spec state, advances the workflow, and writes the result back to disk. The dashboard shows progress regardless of which agent is driving. Use whichever agent is best suited for each phase — Claude for discovery and design, Codex for implementation, Gemini for review.

Quick Start

Claude Code

Step 1 — Install the MCP server (provides tools and prompts):

Add to your user-level settings (~/.claude/settings.json):

{
  "mcpServers": {
    "specflow": {
      "command": "npx",
      "args": ["-y", "@lbruton/specflow@latest", "."]
    }
  }
}

Step 2 — Install the plugin (optional — provides skills and slash commands):

git clone https://github.com/lbruton/specflow.git
cp -r specflow/plugin/ ~/.claude/plugins/marketplaces/specflow-marketplace/

Skills like /prime, /wrap, /spec are available immediately after restarting Claude Code.

Note: The plugin provides skills and commands only — it does not start the MCP server. The MCP server must be installed separately via Step 1.

Gemini CLI — Manual Install

git clone https://github.com/lbruton/specflow.git
cd specflow
npm install && npm run build

Add to your Gemini MCP config (~/.gemini/settings.json or project-level):

{
  "mcpServers": {
    "spec-workflow": {
      "command": "node",
      "args": ["/path/to/specflow/dist/index.js", "/path/to/your/project"]
    }
  }
}

Copy GEMINI.md from the specflow repo root into your project root for agent-specific instructions.

Codex CLI — Manual Install

git clone https://github.com/lbruton/specflow.git
cd specflow
npm install && npm run build

Add to your Codex MCP config (.codex/config.toml or user-level):

[mcp.specflow]
command = "node"
args = ["/path/to/specflow/dist/index.js", "/path/to/your/project"]

Copy CODEX.md from the specflow repo root into your project root for agent-specific instructions.

Any MCP-Compatible Agent — Via npx

{
  "mcpServers": {
    "spec-workflow": {
      "command": "npx",
      "args": ["-y", "@lbruton/specflow@latest", "/path/to/your/project"]
    }
  }
}

Dashboard

Real-time web UI for spec tracking, approvals, and implementation logs. Port 5051 by default.

npx @lbruton/specflow@latest --dashboard --port 5051

MCP Tools

Tool	Description
`spec-status`	Get detailed status of a spec
`spec-list`	List all specs across projects
`approvals`	Manage phase approval workflow
`log-implementation`	Record implementation artifacts
`spec-workflow-guide`	Get workflow guidance
`steering-guide`	Access project steering documents

MCP Prompts

Prompt	Description
`create-spec`	Create a new spec from requirements
`implement-task`	Generate implementation plan for a task
`create-steering-doc`	Create project steering documentation
`refresh-tasks`	Re-sync task state from spec files
`wrap`	End-of-session orchestrator (cleanup, documentation, retro, digest)
`prime`	Fast session quick-start (~15s) with optional deep mode
`audit`	On-demand project health check (code, security, drift, issues)
+ 3 injection prompts	Context injection for guides

v3.1.0 note: /wrap replaces the standalone /goodnight and /digest-session skills, which are now deprecated.

Prerequisites

The core spec workflow works out of the box with Node.js. Extended features use additional services -- some self-hosted, some cloud-based. Local LLM support exists via Ollama but results vary significantly by model size and hardware (a capable GPU is recommended; smaller models may produce lower-quality output):

Component	Purpose	Link
Obsidian	DocVault knowledge base	obsidian.md
mem0	Cross-session episodic memory (cloud API; self-hosted fork planned)	mem0.ai
Milvus	Self-hosted vector DB for Code Context	milvus.io
Claude Code	CLI agent that consumes MCP servers	docs

Architecture

src/
  tools/               # MCP tool definitions (6 tools)
  prompts/             # MCP prompt definitions (10 prompts)
  core/                # Shared logic (parser, task-parser, path-utils)
  dashboard/           # Dashboard backend (multi-server, approval-storage)
  dashboard_frontend/  # React 18 frontend (Vite + Tailwind)
  markdown/            # Document and review templates
  types.ts             # Shared TypeScript types
  index.ts             # CLI entry point

Roadmap

Development

npm install        # Install dependencies
npm run build      # Compile TypeScript + build dashboard frontend
npm run dev        # Development mode with hot reload

Upstream Documentation

Core functionality docs from Pimzino's project:

Credits

Pimzino/spec-workflow-mcp is the foundation. Pimzino designed and built the core architecture: the MCP server, sequential spec workflow, real-time dashboard with blocking approval gates, approval storage, markdown parser, implementation logging, template engine, multi-language support, VSCode extension, Docker deployment, and security hardening. SpecFlow adds workflow extensions, knowledge architecture, and code intelligence on top of that substantial foundation.

theDakshJaitly/mex inspired several planned features: documentation drift detection with a scoring system, deterministic pattern files promoted from session learnings, and post-commit staleness hooks. mex's per-repo memory scaffold and GROW learning loop showed what disciplined context engineering looks like -- SpecFlow's multi-project approach builds on those ideas.

Zilliz/claude-context provides the semantic code search engine that Code Context is forked from.

License

GPL-3.0 -- same as upstream.

Name		Name	Last commit message	Last commit date
Latest commit History 509 Commits
.claude-plugin		.claude-plugin
.github		.github
assets		assets
commands		commands
containers		containers
docs		docs
e2e		e2e
launch		launch
plugin		plugin
scripts		scripts
skills		skills
src		src
vscode-extension		vscode-extension
.DS_Store		.DS_Store
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
GEMINI.md		GEMINI.md
LICENSE		LICENSE
README.ar.md		README.ar.md
README.de.md		README.de.md
README.es.md		README.es.md
README.fr.md		README.fr.md
README.it.md		README.it.md
README.ja.md		README.ja.md
README.ko.md		README.ko.md
README.md		README.md
README.pt.md		README.pt.md
README.ru.md		README.ru.md
README.zh.md		README.zh.md
package-lock.json		package-lock.json
package.json		package.json
playwright.config.ts		playwright.config.ts
playwright.worktree.config.ts		playwright.worktree.config.ts
tsconfig.json		tsconfig.json
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Four Systems, One Workflow

Three-Tier Memory Architecture

Continuous Learning Loop

Spec Workflow Lifecycle

Parallel Subagent Dispatch

Code Intelligence

Comparison

Multi-Agent Support

Cross-Agent Spec Handoff

Quick Start

Claude Code

Gemini CLI — Manual Install

Codex CLI — Manual Install

Any MCP-Compatible Agent — Via npx

Dashboard

MCP Tools

MCP Prompts

Prerequisites

Architecture

Roadmap

Development

Upstream Documentation

Credits

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Four Systems, One Workflow

Three-Tier Memory Architecture

Continuous Learning Loop

Spec Workflow Lifecycle

Parallel Subagent Dispatch

Code Intelligence

Comparison

Multi-Agent Support

Cross-Agent Spec Handoff

Quick Start

Claude Code

Gemini CLI — Manual Install

Codex CLI — Manual Install

Any MCP-Compatible Agent — Via npx

Dashboard

MCP Tools

MCP Prompts

Prerequisites

Architecture

Roadmap

Development

Upstream Documentation

Credits

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages