Skip to content

Fixes#57

Merged
psychocoderHPC merged 1 commit intodevfrom
fixes
Mar 6, 2026
Merged

Fixes#57
psychocoderHPC merged 1 commit intodevfrom
fixes

Conversation

@ikbuibui
Copy link
Copy Markdown
Contributor

@ikbuibui ikbuibui commented Mar 4, 2026

Updates get temperature for newer NVML API versions. Closes #54
Fix bug which causes the temperature measurement thread to exit too early. Closes #56
Add guard in case num GPUs exceeds MAX_GPU_NUM. Closes #55

These fixes are based on patches provided by @alyoshin

Co-authored-by: alyoshin <alyoshin.s@gmail.com>
@psychocoderHPC psychocoderHPC merged commit e94e1ee into dev Mar 6, 2026
2 checks passed
@psychocoderHPC psychocoderHPC deleted the fixes branch March 6, 2026 16:10
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Fix early exit from temperature monitoring thread Check MAX_GPU_NUM vs number of GPUs Use nvmlDeviceGetTemperatureV()

2 participants