Add context_window override so the prompt %/compaction follow it live by chauncygu · Pull Request #137 · SafeRL-Lab/cheetahclaws

chauncygu · 2026-06-04T22:32:58Z

The prompt's context-usage % and the compaction trigger derive from the model's context window, which previously could only be a hardcoded provider default — and max_tokens (the OUTPUT cap) does not change it. Add a dedicated, session-settable override.

cc_config.py: new DEFAULTS key context_window (0 = use the model default).
providers.context_window_override(config): single source that parses the override (positive int wins; 0/unset/negative/non-numeric/bool -> 0).
compaction.get_context_limit(): honors the override, kept distinct from max_tokens. Bidirectional — smaller forces earlier compaction, larger corrects a stale default.
providers.py send paths: apply the same override to the per-call output-cap window (dynamic_cap_max_tokens) so %, compaction, and the cap stay consistent. No-op when unset, so existing behavior is unchanged.
/config context_window=: warns when it exceeds the model's real window (disables compaction -> API may reject oversized prompts) and notes it takes effect on the next prompt (no restart).

Read live each prompt, so changing model or context_window updates the % with no restart. Tests cover parsing, both-direction overrides, and that max_tokens is never read as the context window.

The prompt's context-usage % and the compaction trigger derive from the model's context window, which previously could only be a hardcoded provider default — and max_tokens (the OUTPUT cap) does not change it. Add a dedicated, session-settable override. - cc_config.py: new DEFAULTS key context_window (0 = use the model default). - providers.context_window_override(config): single source that parses the override (positive int wins; 0/unset/negative/non-numeric/bool -> 0). - compaction.get_context_limit(): honors the override, kept distinct from max_tokens. Bidirectional — smaller forces earlier compaction, larger corrects a stale default. - providers.py send paths: apply the same override to the per-call output-cap window (dynamic_cap_max_tokens) so %, compaction, and the cap stay consistent. No-op when unset, so existing behavior is unchanged. - /config context_window=<N>: warns when it exceeds the model's real window (disables compaction -> API may reject oversized prompts) and notes it takes effect on the next prompt (no restart). Read live each prompt, so changing model or context_window updates the % with no restart. Tests cover parsing, both-direction overrides, and that max_tokens is never read as the context window. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

chauncygu merged commit eeaa2e9 into main Jun 4, 2026
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add context_window override so the prompt %/compaction follow it live#137

Add context_window override so the prompt %/compaction follow it live#137
chauncygu merged 1 commit into
mainfrom
context-window-config

chauncygu commented Jun 4, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

chauncygu commented Jun 4, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant