You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Fix training script portability and documentation issues
- sbatch_train.sh: resolve REPO_ROOT from script location, not $(pwd)
- sbatch_train.sh: consolidate PREDECODER_DISABLE_SDR/TORCH_COMPILE defaults
so Docker and bare-metal paths behave identically
- sbatch_train.sh: log message before chmod 1777; add --nodes=1 to multi-GPU examples
- cluster_install_deps.sh: arch-aware Miniconda URL (supports aarch64/ARM)
- cluster_install_deps.sh: single TORCH_CUDA default (remove redundant fallback)
- TRAINING.md: document SHARED_LOG_DIR; correct cluster defaults for SDR/compile vars
- conf/config_qec_decoder_r{9,13}_fp8.yaml: note that training hyperparams come
from internal defaults, point to config_public.yaml
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
|`PREDECODER_TIMING_RUN`| unset | Set `1` for timing/benchmarking mode (disables some overhead). |
148
-
|`PREDECODER_TORCH_COMPILE`| unset |`0` to disable `torch.compile`, `1` to enable. |
149
-
|`PREDECODER_DISABLE_SDR`| unset |`1` to skip Syndrome Density Reduction computation (saves time). |
148
+
|`PREDECODER_TORCH_COMPILE`|`0` when run via `sbatch_train.sh`, otherwise unset |`0` to disable `torch.compile`, `1` to enable. |
149
+
|`PREDECODER_DISABLE_SDR`|`1` when run via `sbatch_train.sh`, otherwise unset |`1` to skip Syndrome Density Reduction computation (saves time on cluster). |
150
150
|`TORCH_COMPILE`| unset | Alternative way to control `torch.compile` (`0`/`1`). |
151
151
|`TORCH_COMPILE_MODE`| unset |`default`, `reduce-overhead`, or `max-autotune`. |
0 commit comments