fix(eval-only): call configure_qwen_chat so local LLM endpoints can be used by imrehg · Pull Request #85 · microsoft/SkillOpt

imrehg · 2026-06-24T07:33:01Z

The eval-only tool skipped configuring some of the backend types, that the training did configure. Because of this, the eval is silently fell back to a local endpoint that wasn't actually configured, and all evaluations runs failed.

Replicate the backend setup based on the trainer's code, and eval-only can run with the qwen_chat backend.

Tested it briefly with a skill/local qwen_chat configuration that correctly run a training cycle.

Closes #84

…n be used The eval-only tool skipped configuring some of the backend types, that the training did configure. Because of this, the eval is silently fell back to a local endpoint that wasn't actually configured, and all evaluations runs failed. Replicate the backend setup based on the trainer's code, and eval-only can run with the qwen_chat backends. Co-authored-by: Qwen-Coder <noreply@qwen.ai>

imrehg changed the title ~~fix(eval-only): call configure_{qwen,minimax}_chat so local LLM endpoints can be used~~ fix(eval-only): call configure_qwen_chat so local LLM endpoints can be used Jun 24, 2026

Tanmay9223 mentioned this pull request Jun 24, 2026

feat(eval): add missing minimax backend configuration #86

Open

Yif-Yang merged commit 99ccb93 into microsoft:main Jun 25, 2026
1 check passed

Yif-Yang mentioned this pull request Jun 25, 2026

test(sleep): add verifier-discipline stress test (closes #67) #87

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(eval-only): call configure_qwen_chat so local LLM endpoints can be used#85

fix(eval-only): call configure_qwen_chat so local LLM endpoints can be used#85
Yif-Yang merged 1 commit into
microsoft:mainfrom
imrehg:eval-with-qwen-minimax

imrehg commented Jun 24, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

imrehg commented Jun 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

imrehg commented Jun 24, 2026 •

edited

Loading