Clear device-related annotations during pod release by archlitchi · Pull Request #5416 · volcano-sh/volcano

archlitchi · 2026-06-09T10:12:53Z

/kind bug

This applies to all sharable devices(NV, Ascend, etc..), so we need a unified fix instead of fixing them one by one.

gemini-code-assist

Code Review

This pull request implements the Release method for AscendDevices, NPUDevices, and GPUDevices to clean up speculative pod annotations during rollback or preemption, utilizing a new RemovePodAnnotations helper. Feedback focuses on preventing potential nil pointer dereferences by moving nil checks before calling SubResource in GPUDevices.Release, adding missing SubResource calls and receiver nil checks in AscendDevices.Release and NPUDevices.Release to avoid in-memory resource leaks, and adding defensive nil checks in RemovePodAnnotations.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

Copilot

Pull request overview

Note

Copilot was unable to run its full agentic suite in this review.

This PR ensures device-related pod annotations are cleaned up during Release() so speculative allocations don’t “stick” and affect subsequent scheduling, while preserving annotations for committed/running pods.

Changes:

Added a shared helper to remove Pod annotations via API server patch.
Updated multiple device plugins’ Release() implementations to clear specific device annotations (with “success” phase preserved where applicable).
Added regression tests for vGPU and HAMi to verify cleanup behavior and preservation for committed pods.

Reviewed changes

Copilot reviewed 6 out of 6 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
pkg/scheduler/api/devices/util.go	Adds `RemovePodAnnotations` helper that patches the Pod to delete annotation keys.
pkg/scheduler/api/devices/nvidia/vgpu/device_info.go	Cleans vGPU annotations in `Release()` for speculative allocations.
pkg/scheduler/api/devices/nvidia/vgpu/device_info_test.go	Adds regression tests verifying speculative cleanup and committed preservation.
pkg/scheduler/api/devices/ascend/mindcluster/ascend310p/vnpu/device_info.go	Cleans NPU-related annotations in `Release()`.
pkg/scheduler/api/devices/ascend/hami/device_info.go	Cleans HAMi/Ascend annotations in `Release()` (including dynamic device keys).
pkg/scheduler/api/devices/ascend/hami/device_info_test.go	Adds tests for speculative cleanup and committed preservation; initializes device key maps for the tests.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Signed-off-by: limengxuan <mengxuan.li@dynamia.ai>

archlitchi · 2026-06-10T10:27:31Z

CC @JesseStutler

JesseStutler · 2026-06-11T06:07:49Z

/approve
/lgtm
Do we need to backport? @archlitchi

volcano-sh-bot · 2026-06-11T06:07:57Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: JesseStutler, nishitnshah

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details

Needs approval from an approver in each of these files:

~~pkg/scheduler/OWNERS~~ [JesseStutler]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

JesseStutler · 2026-06-11T07:00:41Z

/cherrypick release-1.15
/cherrypick release-1.14
/cherrypick release-1.13

volcano-sh-bot · 2026-06-11T07:01:24Z

@JesseStutler: new pull request created: #5429

Details

In response to this:

/cherrypick release-1.15
/cherrypick release-1.14
/cherrypick release-1.13

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Copilot AI review requested due to automatic review settings June 9, 2026 10:12

volcano-sh-bot added the kind/bug Categorizes issue or PR as related to a bug. label Jun 9, 2026

volcano-sh-bot requested review from JesseStutler and william-wang June 9, 2026 10:12

volcano-sh-bot added the size/L Denotes a PR that changes 100-499 lines, ignoring generated files. label Jun 9, 2026

gemini-code-assist Bot reviewed Jun 9, 2026

View reviewed changes

Comment thread pkg/scheduler/api/devices/nvidia/vgpu/device_info.go

Comment thread pkg/scheduler/api/devices/ascend/hami/device_info.go

Comment thread pkg/scheduler/api/devices/ascend/mindcluster/ascend310p/vnpu/device_info.go

Comment thread pkg/scheduler/api/devices/util.go

Copilot AI reviewed Jun 9, 2026

View reviewed changes

volcano-sh-bot added the do-not-merge/contains-merge-commits label Jun 9, 2026

archlitchi force-pushed the master branch from 47d9383 to 7658f74 Compare June 9, 2026 10:51

volcano-sh-bot added do-not-merge/invalid-commit-message Indicates that a PR should not merge because it has an invalid commit message. and removed do-not-merge/contains-merge-commits labels Jun 9, 2026

archlitchi force-pushed the master branch from 7658f74 to c1c779a Compare June 9, 2026 10:56

volcano-sh-bot removed the do-not-merge/invalid-commit-message Indicates that a PR should not merge because it has an invalid commit message. label Jun 9, 2026

archlitchi force-pushed the master branch from c1c779a to c62e2d8 Compare June 9, 2026 11:00

nishitnshah suggested changes Jun 10, 2026

View reviewed changes

Comment thread pkg/scheduler/api/devices/nvidia/vgpu/device_info.go

volcano-sh-bot added the do-not-merge/contains-merge-commits label Jun 10, 2026

archlitchi force-pushed the master branch from 483e251 to 5fcc309 Compare June 10, 2026 10:05

volcano-sh-bot added do-not-merge/contains-merge-commits and removed do-not-merge/contains-merge-commits labels Jun 10, 2026

Clean up annotations during release

03a6238

Signed-off-by: limengxuan <mengxuan.li@dynamia.ai>

archlitchi force-pushed the master branch from 9270a62 to 03a6238 Compare June 10, 2026 10:08

volcano-sh-bot removed the do-not-merge/contains-merge-commits label Jun 10, 2026

nishitnshah approved these changes Jun 10, 2026

View reviewed changes

DSFans2014 mentioned this pull request Jun 11, 2026

fix: release ascend device resources when scheduling fails #5378

Closed

volcano-sh-bot assigned JesseStutler Jun 11, 2026

volcano-sh-bot added the lgtm Indicates that a PR is ready to be merged. label Jun 11, 2026

volcano-sh-bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jun 11, 2026

volcano-sh-bot merged commit ba800be into volcano-sh:master Jun 11, 2026
32 checks passed

volcano-sh-bot mentioned this pull request Jun 11, 2026

[release-1.15] Clear device-related annotations during pod release #5429

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clear device-related annotations during pod release#5416

Clear device-related annotations during pod release#5416
volcano-sh-bot merged 1 commit into
volcano-sh:masterfrom
archlitchi:master

archlitchi commented Jun 9, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

archlitchi commented Jun 10, 2026

Uh oh!

JesseStutler commented Jun 11, 2026

Uh oh!

volcano-sh-bot commented Jun 11, 2026

Uh oh!

Uh oh!

JesseStutler commented Jun 11, 2026

Uh oh!

volcano-sh-bot commented Jun 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

archlitchi commented Jun 9, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

archlitchi commented Jun 10, 2026

Uh oh!

JesseStutler commented Jun 11, 2026

Uh oh!

volcano-sh-bot commented Jun 11, 2026

Uh oh!

Uh oh!

JesseStutler commented Jun 11, 2026

Uh oh!

volcano-sh-bot commented Jun 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants