Skip to content

Comments

MAF-19056: chore(deploy): add moreh-vllm preset generator script#65

Merged
hhk7734 merged 2 commits intomainfrom
MAF-19056
Feb 20, 2026
Merged

MAF-19056: chore(deploy): add moreh-vllm preset generator script#65
hhk7734 merged 2 commits intomainfrom
MAF-19056

Conversation

@hhk7734
Copy link
Member

@hhk7734 hhk7734 commented Feb 20, 2026

Add hack/gen_moreh_vllm_presets.py to generate InferenceServiceTemplate Helm preset files from the preset list in a moreh-vllm container image.

Add hack/gen_moreh_vllm_presets.py to generate InferenceServiceTemplate
Helm preset files from the preset list in a moreh-vllm container image.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
@hhk7734 hhk7734 requested a review from a team as a code owner February 20, 2026 15:24
@hhk7734 hhk7734 requested review from TaeSooRhee-moreh, Copilot and nulledge and removed request for Copilot February 20, 2026 15:24
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Copilot AI review requested due to automatic review settings February 20, 2026 15:40
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Adds a new hack/ utility to generate InferenceServiceTemplate Helm preset templates for moreh-vllm based on the preset YAML filenames found inside a moreh-vllm container image.

Changes:

  • Introduces hack/gen_moreh_vllm_presets.py to parse preset filename stems and emit corresponding *.helm.yaml templates.
  • Encodes a small model registry (org/name/HF path + GPU-count overrides) to enrich generated metadata and compute resource requests.

@hhk7734 hhk7734 merged commit 635a7bc into main Feb 20, 2026
7 checks passed
@hhk7734 hhk7734 deleted the MAF-19056 branch February 20, 2026 15:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant