brain-wrought-engine

Deterministic scoring code for the Brain-Wrought personal-brain benchmark.

What this is

All scoring math. No LLM-in-the-loop except where unavoidable (judge panel for Axis C). Everything else is deterministic Python with seeded randomness.

Modules

retrieval/ — P@k, Recall@k, MRR, nDCG@k, personalization weighting, temporal qrel evaluation, abstention scoring
ingestion/ — entity recall, backlink F1, citation accuracy, schema completeness, setup friction
assistant/ — judge panel orchestration via LiteLLM (Sonnet 4.6 + Opus 4.7 + GPT-5.4), bootstrap confidence intervals
fixtures/ — seeded fixture generation, randomization
leaderboard/ — composite score aggregation

What this is NOT

No CLI (that's brain-wrought-harness)
No markdown skills or docs (those are brain-wrought-skills)
No qrels, gold graphs, or actual judge rubrics (sealed private repo, fetched at eval time via CI)

Determinism classes

Every function is classified in its docstring as one of:

Fully deterministic — bit-identical output for the same input (IEEE 754 caveats)
Seeded-stochastic — identical output given the same seed
Bounded-stochastic — reruns fall within declared confidence interval

CI enforces these claims.

Standards

Python 3.12.3 (pinned)
Pydantic v2 for all data contracts
pytest + pytest-randomly + hypothesis for tests
mypy strict
ruff format + lint (line length 100)
100% coverage on scoring modules

Design rules

No use of global random state (random.random() is banned; use random.Random(seed))
No direct LLM SDK calls — always via LiteLLM
Every function crossing a module boundary has a Pydantic contract on I/O
No side effects in scoring functions (pure functions; any logging happens in the caller)

See CLAUDE.md for the full coding standard.

Installation

pip install brain-wrought-engine

Programmatic use

from brain_wrought_engine.retrieval import precision_at_k, ndcg_at_k

p10 = precision_at_k(relevant={"a", "b", "c"}, retrieved=["a", "x", "b", "y", "c"], k=10)

License

MIT.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.github/workflows		.github/workflows
brain_wrought_engine		brain_wrought_engine
docs		docs
tests		tests
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
DECISIONS.md		DECISIONS.md
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

brain-wrought-engine

What this is

Modules

What this is NOT

Determinism classes

Standards

Design rules

Installation

Programmatic use

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

brain-wrought-engine

What this is

Modules

What this is NOT

Determinism classes

Standards

Design rules

Installation

Programmatic use

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages