Git-native, review-gated knowledge base for LLM agents. MCP server + JSONL tool server + CLI.
Agents should not start every session with amnesia — but they shouldn't get to write whatever they want either.
vouch is a knowledge base for LLM agents with an explicit review gate: agents propose writes; humans approve them with vouch approve. Approved artifacts are plain files on disk — YAML for claims, markdown for pages — so the KB lives in your repo, is reviewed in PRs, diffs cleanly, and can be exported as a portable bundle.
Still alpha — surface is small on purpose; expect breaking changes pre-1.0.
Three opinionated choices distinguish vouch from the neighbours:
- The KB is a folder in your repo. Git is your audit log, your backup, and your sync mechanism. PRs are your review UI.
- Writes require approval. Agents file proposals; a human (or trusted approving agent) explicitly accepts them.
proposed/is gitignored, so rejected drafts never pollute history. - Claims must cite sources. A claim without at least one Source / Evidence id is a validation error, not a warning. Sources are content-hashed; the same evidence registered twice de-duplicates.
Worth it when:
- Multiple agents share a repo (Claude Code + Cursor + a CI bot). Per-agent attribution in the audit log makes "which agent claimed what" answerable.
- Sessions keep re-explaining the same context. Curated, cited claims let new sessions start from your team's agreed answer instead of re-grepping.
- You want decision records without the ADR ceremony.
vouch crystallizepromotes a session's durable parts into proposals; one approve and they're permanent. - You'd review agent-stated facts the way you review agent-written code. Vouch is a PR queue for claimed knowledge.
- Compliance / audit trails matter. Required citations + append-only audit log give you "who decided X, citing what, when" for free.
Skip it if:
- Solo hobby project where context fits in your head.
- Short-lived branches with no compounding context.
- Your team won't actually review proposals — without the gate, vouch is a worse note app.
# from PyPI (published as vouch-kb; the command is still `vouch`)
pipx install vouch-kb
# …or from the cloned repo, in a venv
pip install -e '.[dev]'# 1. set up a KB at your project root
vouch init
# 2. as agent, register evidence + propose claims (via MCP/JSONL server)
vouch serve # MCP over stdio (Claude Code, Cursor, Codex)
vouch serve --transport jsonl # newline-delimited JSON over stdin/stdout
# 3. as human, review and decide
vouch status # one-line summary
vouch pending # list pending proposals
vouch show <id> # full details
vouch approve <id> # → durable artifact
vouch reject <id> --reason "..."
# 4. commit
git add .vouch/ && git commit -m "kb: approve auth-uses-jwt"vouch init now creates a starter config plus one cited example claim, so
you can try the loop before wiring an agent:
mkdir vouch-demo && cd vouch-demo
vouch init
vouch status
vouch search agent
vouch cite vouch-starter-reviewed-knowledgeThe starter claim is already durable and cites the starter source. Replace it with your project's first real source and claim when you are ready.
The full captured walkthrough lives at docs/example-session.md; re-render the GIF from docs/demo.tape with vhs docs/demo.tape.
Source immutable input material (file, URL, transcript, commit, …)
content-addressed by sha256
Evidence span pointer into a Source (line range, timestamp, quote)
Claim atomic durable assertion
type: fact | decision | preference | workflow | observation | question | warning
status: working | actionable | stable | contested | superseded | archived | redacted
confidence: 0.0–1.0
cites: list of Source / Evidence ids
Entity typed thing (person | project | repo | concept | decision | system | …)
Relation typed edge (uses | depends_on | supersedes | contradicts | implements | …)
Page maintained markdown — entity write-up, decision record, session summary
Session work block opened by an agent; bundles its proposals
AuditEvent append-only log record for every mutation
After vouch init, your repo contains:
.vouch/
├── config.yaml # KB settings
├── .gitignore # ignores proposed/, state.db
├── audit.log.jsonl # append-only audit (committed)
├── state.db # SQLite FTS5 index (derived; not committed)
├── claims/<id>.yaml # reviewed claims (init seeds vouch-starter-*.yaml)
├── pages/<id>.md # markdown pages with YAML frontmatter
├── sources/<sha>/{meta.yaml,content}
├── entities/<id>.yaml # graph nodes
├── relations/<id>.yaml # graph edges
├── evidence/<id>.yaml # span pointers into sources
├── sessions/<id>.yaml # agent session records
├── proposed/<id>.yaml # pending (gitignored, local-only)
└── decided/<id>.yaml # approved/rejected (committed for audit)
The files are the source of truth; state.db is a derivable cache (vouch index rebuilds it).
vouch init # set up .vouch/ at PATH
vouch discover [--path P] # find the nearest .vouch/ root
vouch capabilities # emit the JSON capabilities descriptor
vouch status [--json] # KB counts + pending proposals
vouch lint [--stale-days N] # user-actionable problems
vouch doctor # full sweep incl. source verification
vouch pending # list pending proposals
vouch show <proposal-id>
vouch approve <proposal-id> [--reason ...]
vouch reject <proposal-id> --reason "..."
vouch propose-claim --text ... --source ... [--type ...] [--confidence X]
vouch propose-page --title ... [--body -] [--claim ID ...]
vouch propose-entity --name ... --type ...
vouch propose-relation --from ID --rel uses --to ID
vouch source add PATH [--title ...] [--url ...]
vouch source verify [--fail-on-issue]
vouch supersede OLD_ID NEW_ID
vouch contradict CLAIM_A CLAIM_B
vouch archive CLAIM_ID
vouch confirm CLAIM_ID
vouch cite CLAIM_ID
vouch session start [--task ...] [--note ...]
vouch session end SESSION_ID
vouch crystallize SESSION_ID [--no-page]
vouch search QUERY [--limit N]
vouch context TASK [--max-chars N] [--min-items N] [--require-citations]
vouch index
vouch audit [--tail N] [--json]
vouch export --out path.tar.gz
vouch export-check path.tar.gz
vouch import-check path.tar.gz
vouch import-apply path.tar.gz [--on-conflict skip|overwrite|fail]
vouch serve [--transport stdio|jsonl]
Read (unrestricted): kb.capabilities, kb.status, kb.search, kb.context, kb.read_{page,claim,entity,relation}, kb.list_{pages,claims,entities,relations,sources,pending}
Source intake (not gated — evidence is harmless and de-duplicates): kb.register_source, kb.register_source_from_path, kb.source_verify
Write (gated → produce proposals): kb.propose_{claim,page,entity,relation} (with dry_run:true for preview-only)
Decisions: kb.approve, kb.reject (host trust required)
Lifecycle (metadata about reviewed knowledge — direct mutation, audited): kb.supersede, kb.contradict, kb.archive, kb.confirm, kb.cite
Sessions: kb.session_start, kb.session_end, kb.crystallize
Maintenance: kb.index_rebuild, kb.lint, kb.doctor, kb.audit, kb.export, kb.export_check, kb.import_check, kb.import_apply
In your project's .mcp.json:
{
"mcpServers": {
"vouch": {
"command": "vouch",
"args": ["serve"],
"env": { "VOUCH_AGENT": "claude-code" }
}
}
}VOUCH_AGENT is recorded as proposed_by and as the actor on every audit event, so multi-agent setups can attribute writes correctly.
The JSONL transport reads one envelope per line on stdin, writes one per line on stdout:
{"id":"r1","method":"kb.search","params":{"query":"jwt","limit":5}}
{"id":"r1","ok":true,"result":[{"kind":"claim","id":"auth-uses-jwt","snippet":"…","score":1.2,"backend":"fts5"}]}Errors come back with ok:false and a structured error.code (method_not_found, missing_param, invalid_request, internal_error).
vouch export --out kb.tar.gz # tar.gz + manifest.json with per-file sha256
vouch export-check kb.tar.gz # verify every file against the manifest
vouch import-check kb.tar.gz # diff against destination — new / conflict / identical
vouch import-apply kb.tar.gz --on-conflict skip # apply (default skip; never destructive without overwrite)proposed/, state.db, and audit.log.jsonl are excluded from bundles — only committable artifacts travel.
| mem0 / Letta | LLM-Wiki tools | vouch | |
|---|---|---|---|
| Knowledge lives in | a service | filesystem | your repo |
| Review of writes | none | none | explicit approve |
| Evidence required | no | optional | enforced |
| Per-agent attribution | partial | none | yes (audit log) |
| Graph (entities + relations) | no | no | yes |
| FTS5 search | no | varies | yes |
| Portable bundle | no | no | yes |
| Transports | SDK / HTTP | none | MCP + JSONL |
| Area | Current support |
|---|---|
| Knowledge base | .vouch/ folder, YAML claims/entities/relations/evidence/sessions, markdown pages with frontmatter, JSONL audit log, content-addressed sources |
| CLI | init, discover, capabilities, status, lint, doctor, pending, show, approve, reject, propose-{claim,page,entity,relation}, source add, source verify, supersede, contradict, archive, confirm, cite, session {start,end}, crystallize, search, context, index, audit, export, export-check, import-check, import-apply, serve |
| Tool servers | MCP over stdio + JSONL over stdin/stdout, same kb.* surface across both transports, capabilities + knowledge-capability descriptor |
| Schemas | 13 JSON Schemas (Draft 2020-12) generated from pydantic in schemas/, plus hand-maintained bundle.manifest and jsonl-envelope schemas |
| Write safety | review-gated writes via proposed/, dry_run:true previews, host trust required for approve/reject, atomic exclusive-create storage, path-traversal blocked on source intake and bundle import |
| Retrieval | SQLite FTS5 + substring fallback; optional semantic backends (all-mpnet-base-v2, MiniLM-L6, fastembed-BGE) behind install extras; context packs with citations + quality gate |
| Lifecycle | supersede, contradict, archive, confirm, cite — direct mutations, all audited |
| Portability | tar.gz bundles with per-file sha256 manifest.json, export-check, import-check, import-apply with skip/overwrite/fail conflict modes |
| Audit | append-only audit.log.jsonl, per-event actor (VOUCH_AGENT), object ids, dry-run flag, reversible flag |
| Adapters | Claude Code wiring documented via .mcp.json + VOUCH_AGENT env; per-runtime adapter templates not yet shipped |
| Validation | pytest suite (storage, FTS5, audit, source-verify, review-gate, bundle, JSONL), ruff + mypy gates, GitHub CI |
| Specification | dated snapshots under spec/, JSON Schemas in schemas/, generator script at scripts/gen_schemas.py |
Pre-1.0. What's not in this implementation: vector embeddings (BM25/FTS5 only), per-runtime adapter templates, benchmark fixtures, multi-agent sync, scopes beyond a single field on Claim/Source. If a hole matters to you, file an issue.
MIT.
