Commit 82decf1
committed
fix(examples): vllm example uses python3 + moves port into yaml
Found during dev B200 verification (Qwen2.5-0.5B variant of this shape):
- vllm/vllm-openai image has python3 only, no `python` symlink — the
K8s container fails to start with `exec: "python": executable file
not found in $PATH` because the platform's `command` field overrides
the image's ENTRYPOINT entirely.
- The platform helm path passes numeric arg tokens to the Rollout spec
as integers, and the K8s API server rejects them (`args[3] ... must
be of type string: "integer"`). Moving `port: 8000` into the YAML
config keeps every CLI token a non-numeric string while still letting
vLLM pick up the port — and matches the "all config in one file"
intent of --config.
Verified end-to-end on dev (cluster c-01-c-11-centml-org, hw x1-large-b200):
created deployment via SDK → rollout HEALTHY at t+76s → POST
/v1/chat/completions returned HTTP 200 with a real Qwen completion.
Signed-off-by: Honglin Cao <hocao@nvidia.com>1 parent 2095ab5 commit 82decf1
2 files changed
Lines changed: 2 additions & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
27 | 27 | | |
28 | 28 | | |
29 | 29 | | |
30 | | - | |
| 30 | + | |
31 | 31 | | |
32 | 32 | | |
33 | 33 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
1 | 2 | | |
2 | 3 | | |
3 | 4 | | |
| |||
0 commit comments