Skip to content

e2e Test - full-sweep --multi-node --evals-only --config-files .github/configs/amd-master.yaml #1117

e2e Test - full-sweep --multi-node --evals-only --config-files .github/configs/amd-master.yaml

e2e Test - full-sweep --multi-node --evals-only --config-files .github/configs/amd-master.yaml #1117

Re-run triggered March 17, 2026 00:19
Status Failure
Total duration 1m 12s
Artifacts 5

e2e-tests.yml

on: workflow_dispatch
get-jobs
9s
get-jobs
Matrix: single-node /
Waiting for pending jobs
Matrix: multi-node eval /
Matrix: multi-node /
Waiting for pending jobs
collect-evals  /  collect-evals
15s
collect-evals / collect-evals
collect-results  /  collect-results
15s
collect-results / collect-results
calc-success-rate
17s
calc-success-rate
Fit to window
Zoom out
Zoom in

Annotations

1 error and 1 warning
multi-node eval / (rocm/sgl-dev:sglang-0.5.9-rocm720-mi35x-mori-0227-3, amd/DeepSeek-R1-0528-MXFP... / dsr1_8k1k fp4 mi355x-disagg sglang-disagg | P(tp4/ep4/dptrue/nw4) D(tp8/ep8/dptrue/nw1) | disagg-true spec-mtp conc-1024x2048x4096 | eval-only
File was unable to be removed Error: EACCES: permission denied, rmdir '/it-share/gharunners/gharunner8/actions-runner/_work/InferenceX/InferenceX/benchmark_logs/logs/slurm_job-5656'

Artifacts

Produced during runtime
Name Size Digest
eval_results_all
152 Bytes
sha256:7bee59da6675400d058057fb7d3b8bedaf0f91ee07de05cf9337c873b5dd8939
run-stats
235 Bytes
sha256:d6a0ffdebac5d7d16e655553edd226364b4c6afbb7e33c132b70127d3ef55a58