fix(devserver): honor evaluator.project_id when request omits it by willfrey · Pull Request #372 · braintrustdata/braintrust-sdk-python

Will Frey (willfrey) · 2026-04-29T18:40:49Z

Summary

The dev-server's run_eval builds EvalAsync(...) kwargs with:

{**eval_kwargs, ..., "project_id": eval_data.get("project_id")}

The trailing key always wins in dict-spread merging, so a request body that omits project_id silently overrides the registered evaluator's project_id to None. EvalAsync(name=..., project_id=None) then falls back to using the eval name as the project name (per Eval(...) docstring: "If specified, uses the given project ID instead of the evaluator's name to identify the project."), so experiments route into a per-evaluator-name auto-created project instead of the project the evaluator was registered against.

This bites consumers who mount the dev-server behind a custom auth layer and trigger evals from anything other than the Braintrust playground UI: every triggered run lands in a fresh eval-name-keyed project rather than the canonical project the registered Evaluator(project_id=...) named.

Fix

Fall back to evaluator.project_id when the request omits it. An explicit request-level project_id still takes precedence (no behavior change for the playground UI flow).

project_id = eval_data.get("project_id") or evaluator.project_id

Test plan

test_eval_falls_back_to_evaluator_project_id_when_request_omits_it — registers an evaluator with a known project_id, POSTs /eval without project_id, asserts EvalAsync receives the registered id. (Fails on main, passes with this fix.)
test_eval_request_project_id_overrides_evaluator — confirms an explicit request-level project_id still wins.
Full py/src/braintrust/devserver/ test suite green (21 passed, 2 pre-existing skips).
nox -s pylint passes.
Pre-commit hooks (ruff format, ruff check, codespell) pass.

The dev-server's run_eval built EvalAsync(...) kwargs with {**eval_kwargs, ..., "project_id": eval_data.get("project_id")} The trailing key always wins in dict-spread merging, so a request body that omits project_id silently overrode the registered evaluator's project_id to None. EvalAsync(name=..., project_id=None) then fell back to using the eval name as the project name (per Eval(... project_id) docs: "If specified, uses the given project ID instead of the evaluator's name to identify the project."), so experiments routed into a per-evaluator-name auto-created project instead of the project the evaluator was registered against. Use evaluator.project_id as a fallback when the request omits it. An explicit project_id in the request still takes precedence. Tests: - test_eval_falls_back_to_evaluator_project_id_when_request_omits_it — registers an evaluator with a known project_id, posts /eval without project_id, asserts EvalAsync receives the registered id. - test_eval_request_project_id_overrides_evaluator — confirms an explicit request-level project_id still wins.

Will Frey (willfrey) · 2026-04-29T19:08:07Z

Updated to use the cleaner two-arg dict.get() form:

project_id = eval_data.get("project_id", evaluator.project_id)

The previous ... or evaluator.project_id form treated empty string the same as a missing key, which conflated "absent" with "falsy" — surprising for a literal value the request explicitly sent. The two-arg form expresses the intent more directly: fall back only when the key is absent. The validator in schemas.py already rejects non-string project_id values, so None is impossible through this path.

Both tests still pass (the regression test omits the key; the override test passes a non-empty string — neither exercises the empty-string edge case where the two forms diverge).

Abhijeet Prasad (AbhiPrasad) · 2026-05-01T19:27:53Z

gonna push up a small adjustment for this, and we can merge it in. As always, thanks for the PR Will Frey (@willfrey)!

The previous ... or evaluator.project_id form treated empty string the same as a missing key, which conflated "absent" with "falsy" — surprising for a literal value the request explicitly sent. The two-arg form expresses the intent more directly: fall back only when the key is absent. The validator in schemas.py already rejects non-string project_id values, so None is impossible through this path.

I know this is more correct, but our backend implementation actually uses or, so I'm swapping out for this and adding a test for it accordingly. Just feels safer to have this if things change.

Will Frey (willfrey) · 2026-05-01T19:29:13Z

Thank you! Adjust away :) I appreciate your responsiveness!

Includes - #367 - #372 - #344 - #375 - #380 - #381

Abhijeet Prasad (AbhiPrasad) · 2026-05-04T16:47:09Z

released with https://github.com/braintrustdata/braintrust-sdk-python/releases/tag/py-sdk-v0.19.0

Abhijeet Prasad (AbhiPrasad) self-assigned this Apr 29, 2026

Will Frey (willfrey) force-pushed the fix/devserver-honor-evaluator-project-id branch from bff47a1 to 69505ff Compare April 29, 2026 19:07

Merge branch 'main' into fix/devserver-honor-evaluator-project-id

bd57b98

use or to match backend devserver logic

2f81ac1

Abhijeet Prasad (AbhiPrasad) enabled auto-merge (squash) May 1, 2026 19:31

Abhijeet Prasad (AbhiPrasad) approved these changes May 1, 2026

View reviewed changes

Abhijeet Prasad (AbhiPrasad) merged commit 7c04444 into braintrustdata:main May 1, 2026
82 checks passed

Abhijeet Prasad (AbhiPrasad) mentioned this pull request May 4, 2026

chore: Bump to 0.19.0 #390

Merged

Abhijeet Prasad (AbhiPrasad) added a commit that referenced this pull request May 4, 2026

chore: Bump to 0.19.0 (#390)

16440ce

Includes - #367 - #372 - #344 - #375 - #380 - #381

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(devserver): honor evaluator.project_id when request omits it#372

fix(devserver): honor evaluator.project_id when request omits it#372
Abhijeet Prasad (AbhiPrasad) merged 3 commits intobraintrustdata:mainfrom
willfrey:fix/devserver-honor-evaluator-project-id

Will Frey (willfrey) commented Apr 29, 2026

Uh oh!

Will Frey (willfrey) commented Apr 29, 2026

Uh oh!

Abhijeet Prasad (AbhiPrasad) commented May 1, 2026 •

edited

Loading

Uh oh!

Will Frey (willfrey) commented May 1, 2026

Uh oh!

Uh oh!

Abhijeet Prasad (AbhiPrasad) commented May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Will Frey (willfrey) commented Apr 29, 2026

Summary

Fix

Test plan

Uh oh!

Will Frey (willfrey) commented Apr 29, 2026

Uh oh!

Abhijeet Prasad (AbhiPrasad) commented May 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Will Frey (willfrey) commented May 1, 2026

Uh oh!

Uh oh!

Abhijeet Prasad (AbhiPrasad) commented May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Abhijeet Prasad (AbhiPrasad) commented May 1, 2026 •

edited

Loading