[codex] Fix NVIDIA hosted model slugs by ChunkyMonkey11 · Pull Request #4 · ChunkyMonkey11/Claudia-Router

ChunkyMonkey11 · 2026-06-03T00:24:26Z

Summary

Updates the NVIDIA fast and fallback profiles to use the hosted Build/API model slugs that work on https://integrate.api.nvidia.com/v1.

Root cause

npm run claude:fast was routing claude-3-5-sonnet-latest to the Hugging Face-style BF16 model ID nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16. NVIDIA's hosted endpoint returned 404 page not found for that model ID. Direct provider probes confirmed the hosted slug nvidia/nemotron-3-super-120b-a12b succeeds on the same endpoint and account.

The Nano fallback had the same stale-slug pattern, so it was updated to nvidia/nemotron-3-nano-30b-a3b.

Changes

Updates generated NVIDIA provider defaults and profiles.
Updates docs and wrapper model help text.
Updates release smoke and unit test assertions.
Adds NVIDIA extraBody defaults for fast/fallback profiles to disable reasoning traces in normal Claude responses.

Validation

npm run typecheck
npm test
npm run status
npm run claude:fast -- -p "Reply with exactly: router-smoke" returned router-smoke

Fix NVIDIA hosted model slugs

244a0dd

ChunkyMonkey11 marked this pull request as ready for review June 3, 2026 01:00

ChunkyMonkey11 merged commit 6cf6912 into main Jun 3, 2026
2 of 3 checks passed

ChunkyMonkey11 deleted the codex/fix-nvidia-hosted-model-slugs branch June 3, 2026 01:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[codex] Fix NVIDIA hosted model slugs#4

[codex] Fix NVIDIA hosted model slugs#4
ChunkyMonkey11 merged 1 commit into
mainfrom
codex/fix-nvidia-hosted-model-slugs

ChunkyMonkey11 commented Jun 3, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ChunkyMonkey11 commented Jun 3, 2026

Summary

Root cause

Changes

Validation

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant