Skip to content

Commit c7af423

Browse files
authored
fix(ci): add Pascal compute capability targets to CUDA wheel builds (abetlen#2237)
1 parent 26633bd commit c7af423

3 files changed

Lines changed: 5 additions & 2 deletions

File tree

.github/workflows/build-wheels-cuda.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -169,10 +169,10 @@ jobs:
169169
}
170170
$cudaTagVersion = $nvccVersion.Replace('.','')
171171
$env:VERBOSE = '1'
172-
# Build real cubins for the supported GPUs, including sm_70, and keep
172+
# Build real cubins for the supported GPUs, including Pascal, and keep
173173
# one forward-compatible PTX target instead of embedding PTX for every
174174
# SM. This keeps the wheel under GitHub's 2 GiB release-asset limit.
175-
$env:CMAKE_ARGS = "-DGGML_CUDA_FORCE_MMQ=ON -DGGML_CUDA=on -DCMAKE_CUDA_ARCHITECTURES=70-real;75-real;80-real;86-real;89-real;90-real;90-virtual -DCMAKE_CUDA_FLAGS=-allow-unsupported-compiler -DCMAKE_CUDA_FLAGS_INIT=-allow-unsupported-compiler $env:CMAKE_ARGS"
175+
$env:CMAKE_ARGS = "-DGGML_CUDA_FORCE_MMQ=ON -DGGML_CUDA=on -DCMAKE_CUDA_ARCHITECTURES=60-real;61-real;70-real;75-real;80-real;86-real;89-real;90-real;90-virtual -DCMAKE_CUDA_FLAGS=-allow-unsupported-compiler -DCMAKE_CUDA_FLAGS_INIT=-allow-unsupported-compiler $env:CMAKE_ARGS"
176176
$env:CMAKE_ARGS = $env:CMAKE_ARGS + ' -DGGML_AVX2=off -DGGML_FMA=off -DGGML_F16C=off'
177177
python -m build --wheel
178178
# Publish tags that reflect the actual installed toolkit version.

CHANGELOG.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -7,6 +7,8 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
77

88
## [Unreleased]
99

10+
- fix(ci): add Pascal compute capability targets to CUDA wheel builds by @abetlen in #2237
11+
1012
## [0.3.24]
1113

1214
- docs: update ROCm install instructions by @agronholm in #1867

README.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -126,6 +126,7 @@ CMAKE_ARGS="-DGGML_CUDA=on" pip install llama-cpp-python
126126
It is also possible to install a pre-built wheel with CUDA support. As long as your system meets some requirements:
127127

128128
- CUDA Version is 12.1, 12.2, 12.3, 12.4 or 12.5
129+
- NVIDIA GPU compute capability is 6.0 or newer
129130
- Python Version is 3.10, 3.11 or 3.12
130131

131132
```bash

0 commit comments

Comments
 (0)