e2e Test - full-sweep --multi-node --evals-only --config-files .github/configs/amd-master.yaml #1117
e2e-tests.yml
on: workflow_dispatch
get-jobs
9s
Matrix: single-node /
Waiting for pending jobs
Matrix: multi-node eval /
Matrix: multi-node /
Waiting for pending jobs
calc-success-rate
17s
Annotations
1 error and 1 warning
|
multi-node eval / (rocm/sgl-dev:sglang-0.5.9-rocm720-mi35x-mori-0227-3, amd/DeepSeek-R1-0528-MXFP... / dsr1_8k1k fp4 mi355x-disagg sglang-disagg | P(tp4/ep4/dptrue/nw4) D(tp8/ep8/dptrue/nw1) | disagg-true spec-mtp conc-1024x2048x4096 | eval-only
File was unable to be removed Error: EACCES: permission denied, rmdir '/it-share/gharunners/gharunner8/actions-runner/_work/InferenceX/InferenceX/benchmark_logs/logs/slurm_job-5656'
|
|
multi-node eval / (rocm/sgl-dev:sglang-0.5.9-rocm720-mi35x-mori-0227-3, amd/DeepSeek-R1-0528-MXFP... / dsr1_8k1k fp4 mi355x-disagg sglang-disagg | P(tp4/ep4/dptrue/nw4) D(tp8/ep8/dptrue/nw1) | disagg-true spec-mtp conc-1024x2048x4096 | eval-only
Unable to clean or reset the repository. The repository will be recreated instead.
|
Artifacts
Produced during runtime
| Name | Size | Digest | |
|---|---|---|---|
|
eval_results_all
|
152 Bytes |
sha256:7bee59da6675400d058057fb7d3b8bedaf0f91ee07de05cf9337c873b5dd8939
|
|
|
run-stats
|
235 Bytes |
sha256:d6a0ffdebac5d7d16e655553edd226364b4c6afbb7e33c132b70127d3ef55a58
|
|