Add cuStabilizer BitMatrixSampler integration to DEM sampling by kvmto · Pull Request #24 · NVIDIA/Ising-Decoding

kvmto · 2026-03-24T16:42:10Z

Summary

Replaces the pure-torch dem_sampling() with a version that transparently uses cuQuantum's BitMatrixSampler when available, falling back to the original torch path when cuST is not installed or USE_CUSTAB=0
Adds custab_matrix_sampling() with sampler caching, max_shots tracking, and CuPy zero-copy DLPack GPU pipeline (torch → CuPy → cuStabilizer → CuPy → torch)
Adds timing instrumentation (get_dem_sampling_avg_ms) for training logs, input validation on H/p shapes, and USE_CUSTAB env var toggle

Files changed

code/qec/dem_sampling.py — core implementation (modified)
code/tests/test_dem_sampling_custab.py — cuST-specific + torch fallback unit tests (new)
code/tests/test_dem_sampling_integration.py — end-to-end pipeline test via MemoryCircuitTorch (new)

Test plan

Existing test_dem_sampling.py passes (API contract preserved)
New test_dem_sampling_custab.py passes (cuST path + torch fallback with deterministic p)
New test_dem_sampling_integration.py passes (full precompute_dem → generate_batch pipeline)
Smoke training run confirms cuST path activates: Using cuST BitMatrixSampler path (max_shots=1024, gpu_native=True)
DLPack zero-copy round-trip verified between torch and CuPy on GPU

Replace the pure-torch dem_sampling with a version that transparently uses cuQuantum's BitMatrixSampler when available, falling back to the original torch path when cuST is not installed or USE_CUSTAB=0. - custab_matrix_sampling() with sampler caching and max_shots tracking - CuPy zero-copy DLPack GPU pipeline (torch -> cupy -> cuST -> torch) - Timing instrumentation (get_dem_sampling_avg_ms) for training logs - Input validation on H/p shapes - USE_CUSTAB env var toggle with reset helpers for testing - Vectorized measure_from_stacked_frames (kept from main) - New tests: test_dem_sampling_custab.py, test_dem_sampling_integration.py Signed-off-by: kvmto <kmato@nvidia.com>

requirements_public_inference.txt: - Document cupy-cudaXXX as an optional GPU-only prerequisite alongside the existing tensorrt comment; explains the DLPack fallback behaviour. tests/test_dem_sampling_custab.py: - Add TestDEMSamplingCupyGPUPath (skipped unless custab + CuPy + CUDA are all present) covering: - _CUPY_AVAILABLE flag is set - correct shape and uint8 dtype from the GPU-native path - deterministic syndrome matches expected checks - GPU/CuPy result matches torch CPU fallback on deterministic input NOTICE: - Add CuPy (MIT, Preferred Networks) entry - Add TensorRT (Apache 2.0, NVIDIA) entry — was missing - Add onnxscript (MIT, Microsoft) entry — was missing - Add OmegaConf (BSD-3-Clause, Omry Yadan) entry — was missing - Include full license text or reference for all new entries Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Signed-off-by: Ivan Basov <ibasov@nvidia.com>

* Add cuStabilizer BitMatrixSampler integration to DEM sampling Replace the pure-torch dem_sampling with a version that transparently uses cuQuantum's BitMatrixSampler when available, falling back to the original torch path when cuST is not installed or USE_CUSTAB=0. - custab_matrix_sampling() with sampler caching and max_shots tracking - CuPy zero-copy DLPack GPU pipeline (torch -> cupy -> cuST -> torch) - Timing instrumentation (get_dem_sampling_avg_ms) for training logs - Input validation on H/p shapes - USE_CUSTAB env var toggle with reset helpers for testing - Vectorized measure_from_stacked_frames (kept from main) - New tests: test_dem_sampling_custab.py, test_dem_sampling_integration.py Signed-off-by: kvmto <kmato@nvidia.com> * feat: add CuPy dependency, tests, and NOTICE entry requirements_public_inference.txt: - Document cupy-cudaXXX as an optional GPU-only prerequisite alongside the existing tensorrt comment; explains the DLPack fallback behaviour. tests/test_dem_sampling_custab.py: - Add TestDEMSamplingCupyGPUPath (skipped unless custab + CuPy + CUDA are all present) covering: - _CUPY_AVAILABLE flag is set - correct shape and uint8 dtype from the GPU-native path - deterministic syndrome matches expected checks - GPU/CuPy result matches torch CPU fallback on deterministic input NOTICE: - Add CuPy (MIT, Preferred Networks) entry - Add TensorRT (Apache 2.0, NVIDIA) entry — was missing - Add onnxscript (MIT, Microsoft) entry — was missing - Add OmegaConf (BSD-3-Clause, Omry Yadan) entry — was missing - Include full license text or reference for all new entries Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Signed-off-by: Ivan Basov <ibasov@nvidia.com> --------- Signed-off-by: kvmto <kmato@nvidia.com> Signed-off-by: Ivan Basov <ibasov@nvidia.com> Co-authored-by: Ivan Basov <ibasov@nvidia.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>

kvmto force-pushed the custab-dem-sampling branch from 888abbd to 777ff4e Compare March 24, 2026 16:45

ivanbasov reviewed Mar 24, 2026

View reviewed changes

Comment thread code/qec/dem_sampling.py

ivanbasov force-pushed the custab-dem-sampling branch from c2bf416 to 519febb Compare March 24, 2026 22:24

ivanbasov force-pushed the custab-dem-sampling branch 2 times, most recently from 791cd68 to 70e60f9 Compare March 24, 2026 22:29

ivanbasov force-pushed the custab-dem-sampling branch from 70e60f9 to 85214f3 Compare March 24, 2026 22:32

ivanbasov self-requested a review March 24, 2026 22:33

ivanbasov approved these changes Mar 24, 2026

View reviewed changes

ivanbasov merged commit a250ca1 into NVIDIA:main Mar 24, 2026
12 checks passed

ivanbasov deleted the custab-dem-sampling branch March 24, 2026 22:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add cuStabilizer BitMatrixSampler integration to DEM sampling#24

Add cuStabilizer BitMatrixSampler integration to DEM sampling#24
ivanbasov merged 2 commits into
NVIDIA:mainfrom
kvmto:custab-dem-sampling

kvmto commented Mar 24, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

kvmto commented Mar 24, 2026

Summary

Files changed

Test plan

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants