feat(anthropic): add dedicated Claude-compatible endpoint layer by dwnmf · Pull Request #97 · Soju06/codex-lb

dwnmf · 2026-02-22T14:41:29Z

This PR adds a dedicated Anthropic/Claude-compatible API surface on top of the existing OpenAI-compatible backend.

What it does: Claude Code clients expect a different contract than OpenAI-compatible ones. Rather than pushing that complexity onto clients, this adds a translation layer that keeps all the mapping logic in one place — new endpoints, format translation, stream adaptation, the whole thing.

New endpoints:

POST /v1/messages and POST /anthropic/v1/messages
POST /v1/messages/count_tokens and POST /anthropic/v1/messages/count_tokens

What's in the module: Request/response schemas for messages and tool blocks, a translator from Anthropic message format to Responses payload, stream adaptation back to Anthropic event format, and Claude model aliases that normalize to the configured upstream lane. Claude requests get forced to gpt-5.3-codex with xhigh reasoning — conflicting sampling fields are stripped automatically.

Prompt caching: Cache key extraction for Anthropic payloads, a dedicated lane for count-tokens flows, and safer handling of metadata/cache-control fields that were previously causing upstream 400 errors.

Observability: Request logs now include hashed Codex session and conversation IDs. Test coverage extended for migrations, request logs, translator behavior, and Anthropic compatibility.

Known issue — cache hit rate is intermittent. Not guaranteed per request. Example from local logs:

14:11:25 -> input_tokens=15532, cached_input_tokens=0
14:11:32 -> input_tokens=15603, cached_input_tokens=15104
14:11:43 -> input_tokens=15629, cached_input_tokens=0

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 5c79393a2e

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-02-22T14:49:27Z

app/core/handlers/exceptions.py

    if path.startswith("/v1/") or path.startswith("/backend-api/"):
        return "openai"


Route /v1/messages validation failures to Anthropic envelope

The fallback format detection still treats every /v1/* path as OpenAI, so requests like malformed JSON to /v1/messages (or /v1/messages/count_tokens) can bypass set_anthropic_error_format and return the OpenAI error shape instead of Anthropic's {"type":"error",...} envelope. Because JSON/body parsing errors are raised before route dependencies run, Anthropic clients can receive an incompatible error contract on exactly the endpoints this change adds.

Useful? React with 👍 / 👎.

…idation errors

…pshot-20260222

…ases - Return stop_reason="stop_sequence" when upstream response includes a stop_sequence value, instead of incorrectly falling through to "end_turn" (affects both streaming and non-streaming paths) - Skip camelCase cache key aliases (promptCacheKey, promptCacheRetention) in outbound payloads when the snake_case canonical form already exists, preventing duplicate fields that can cause upstream validation errors

Define Phase 5 re-review loop: after fixes are committed, re-run Codex review until 0 findings. Safety limits: max 3 iterations, recurring findings marked wont_fix, user can stop early via HITL override.

…pshot-20260222

- Add x-api-key to IGNORE_INBOUND_HEADERS to prevent client credentials from leaking to upstream OpenAI requests via the Anthropic endpoint - Validate model access and enforce limits against the original requested model name (payload.model) instead of the remapped upstream model, so API keys restricted to Claude model names work correctly

Write passthrough cache keys using their canonical snake_case form (prompt_cache_key, prompt_cache_retention) instead of preserving the original camelCase input, so upstream fields are consistently named.

Prevent _merge_passthrough_cache_extras from re-adding raw model_extra values for keys already handled by the normalization pipeline (prompt_cache_key, prompt_cache_retention). This avoids forwarding malformed values (e.g. whitespace-only) that normalization rejected.

Read promptCacheRetention (camelCase) as fallback in _extract_prompt_cache_retention, matching the pattern used by _extract_explicit_prompt_cache_key. Prevents silent loss of retention config when clients use camelCase field names.

Soju06 · 2026-02-26T08:42:24Z

info.md

Please either pass the document to openspec or remove it

Soju06 · 2026-02-26T08:51:40Z

app/modules/anthropic_compat/api.py

+
+logger = logging.getLogger(__name__)
+
+_CLAUDE_CODE_FORCED_MODEL = "gpt-5.3-codex"


Aliasing a model like claude-* to gpt-5.3-codex is probably not a good idea
Following the approach in https://code.claude.com/docs/en/llm-gateway, modify and use the environment variables for claude code
codex-lb should only provide a transport layer

Sure. It was an approach during developing phase when I was trying to eliminate cache issues

Soju06 · 2026-02-26T08:52:07Z

app/modules/anthropic_compat/api.py

Service/dependency layer architecture violation

dwnmf · 2026-02-26T17:25:38Z

Hi @Soju06, thanks for the review. I removed the forced Claude model aliasing and the hardcoded reasoning override so the Anthropic-compatible layer now stays transport-only, moved the API key reservation/release flow out of the route layer into the service layer, removed the out-of-band info.md notes and synced the related updates into OpenSpec, and added support for mapping reasoningEffort / reasoning.effort (including a server default via CODEX_LB_ANTHROPIC_DEFAULT_REASONING_EFFORT). I also updated and re-ran the relevant tests. One caveat: prompt caching is still not working correctly right now.

dwnmf added 2 commits February 22, 2026 15:28

chore(wip): snapshot uncommitted changes

5c79393

fix(anthropic-compat): resolve ruff and ty diagnostics

702e41a

chatgpt-codex-connector bot reviewed Feb 22, 2026

View reviewed changes

dwnmf and others added 11 commits February 22, 2026 15:52

chore(format): apply ruff formatting

6c8e9b8

fix(anthropic-compat): return anthropic envelope for /v1/messages val…

20abe11

…idation errors

fix(anthropic-compat): stabilize claude shared cache key policy

2860039

Merge remote-tracking branch 'origin/main' into chore/uncommitted-sna…

6e1206e

…pshot-20260222

feat(codex-review): add re-review loop with convergence termination

282b26d

Define Phase 5 re-review loop: after fixes are committed, re-run Codex review until 0 findings. Safety limits: max 3 iterations, recurring findings marked wont_fix, user can stop early via HITL override.

Merge remote-tracking branch 'origin/main' into chore/uncommitted-sna…

ad1dba6

…pshot-20260222

fix(anthropic): normalize camelCase cache extras to canonical snake_case

5699cb4

Write passthrough cache keys using their canonical snake_case form (prompt_cache_key, prompt_cache_retention) instead of preserving the original camelCase input, so upstream fields are consistently named.

Soju06 requested changes Feb 26, 2026

View reviewed changes

Soju06 assigned dwnmf Feb 26, 2026

Soju06 added the enhancement New feature or request label Feb 26, 2026

azkore mentioned this pull request Feb 26, 2026

feat(anthropic): use official Anthropic endpoints/models with API+SDK routes #107

Draft

dwnmf added 3 commits February 26, 2026 18:08

fix(anthropic): align transport layer with review requests

dce84f8

feat(anthropic): map reasoning effort aliases in compat translator

3451df6

feat(anthropic): add configurable default reasoning effort

fd76945

dwnmf requested a review from Soju06 February 26, 2026 17:38

joeblack2k mentioned this pull request Feb 27, 2026

Plan: production-grade Anthropic compatibility takeover from #107 (canary-first) #114

Open

29 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(anthropic): add dedicated Claude-compatible endpoint layer#97

feat(anthropic): add dedicated Claude-compatible endpoint layer#97
dwnmf wants to merge 16 commits intoSoju06:mainfrom
dwnmf:chore/uncommitted-snapshot-20260222

dwnmf commented Feb 22, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Feb 22, 2026

Uh oh!

Soju06 Feb 26, 2026

Uh oh!

dwnmf Feb 26, 2026

Uh oh!

Soju06 Feb 26, 2026

Uh oh!

dwnmf Feb 26, 2026

Uh oh!

Soju06 Feb 26, 2026

Uh oh!

dwnmf commented Feb 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		if path.startswith("/v1/") or path.startswith("/backend-api/"):
		return "openai"


		logger = logging.getLogger(__name__)

		_CLAUDE_CODE_FORCED_MODEL = "gpt-5.3-codex"

Conversation

dwnmf commented Feb 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Feb 22, 2026

Choose a reason for hiding this comment

Uh oh!

Soju06 Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

dwnmf Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

Soju06 Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

dwnmf Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

Soju06 Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

dwnmf commented Feb 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

dwnmf commented Feb 22, 2026 •

edited

Loading