test(e2e): add GPU workload image artifacts#1484
Conversation
E2E Test AttestationLocal E2E tests passed. CI does not currently run this GPU host validation, so this comment records the local run.
Test SummaryTests Executed
Direct Image Validation
|
|
At the time this was posted, I was considering moving the image definitions out of this repo to reduce churn. That is no longer the active path for this PR. Review and merge readiness should be evaluated against the changes in this PR directly. |
7c6557a to
5cc2d92
Compare
5cc2d92 to
efe4d25
Compare
PR Review StatusValidation: this is maintainer-authored, project-valid GPU E2E infrastructure work for #1476, adding local GPU workload image artifacts and build task wiring. Review findings:
Docs: this is E2E infrastructure documentation, not a direct Fern UX/API change. Next state: |
|
Label |
Closes #1476 Define local GPU workload image sources for smoke-pass, smoke-fail, and cuda-basic validation, plus a mise build task that tags images with the source revision and records the latest local image refs. Signed-off-by: Evan Lezar <elezar@nvidia.com>
Signed-off-by: Evan Lezar <elezar@nvidia.com>
efe4d25 to
de40d64
Compare
Signed-off-by: Evan Lezar <elezar@nvidia.com>
Signed-off-by: Evan Lezar <elezar@nvidia.com>
Re-check After Author UpdateI re-evaluated latest head Disposition: resolved. Remaining items:
Next state: |
de40d64 to
8426fac
Compare
|
Updated PR #1484 on top of current Gator follow-up in
Also rewrote the branch to add the missing DCO sign-off on the existing README-lint commit. Validation on the updated branch: |
Maintainer Approval NeededGator validation and PR monitoring are complete for latest head Validation: maintainer-authored GPU E2E infrastructure work for #1476, adding local GPU workload image artifacts and build task wiring. Human maintainer approval or merge decision is now required. |
Summary
Define local GPU workload image artifacts for smoke-pass, smoke-fail, and cuda-basic validation. The build task supports Docker or Podman through the existing container-engine helper, tags images with the OpenShell source revision plus an external-input fingerprint, and writes latest local refs plus a workload manifest for downstream tooling.
Related Issue
Closes #1476
Changes
e2e/gpu/images/smoke-pass: adds a positive marker-only workload image.e2e/gpu/images/smoke-fail: adds a stable negative-path workload image.e2e/gpu/images/cuda-basic: builds CUDA samplesdeviceQueryandvectorAddfromNVIDIA/cuda-samplesv12.8, copies the binaries into the OpenShell community base image, and runs both validations.tasks/scripts/e2e-gpu-build-images.sh: builds only the supported workload image set, supports subset selection, records source and external build inputs, labels images, writeslatest.env, and emitsworkloads.yaml.tasks/test.toml: addsmise run e2e:workloads:build.e2e/gpu/README.md: documents the workload image contract, build task, direct validation flow, currentmise run e2e:gpubehavior, and the fact that the generated manifest is not consumed by the current Rust GPU target yet..gitignore: ignores generated GPU workload build metadata.Review Follow-up
Testing
bash -n tasks/scripts/e2e-gpu-build-images.shgit diff --checkmise run pre-commitmise run e2e:gpuvalidation is recorded in the PR commentsTests added:
Checklist
Documentation updated:
e2e/gpu/README.md: workload contract, build task, generated manifest, current GPU e2e target, and direct validation.e2e/gpu/images/*/README.md: per-image purpose, build, and direct-run instructions.