sparse-solve

A minimal scientific-computing library for solving large sparse linear systems Ax = b using iterative Krylov methods with:

CPU (single-thread) baseline
CPU (OpenMP) acceleration
GPU (CUDA) vector ops + scaffolding

This repo is designed to teach you the math and the performance fundamentals as you build.

Build

CPU only

cmake -S . -B build -DCMAKE_BUILD_TYPE=Release
cmake --build build -j
ctest --test-dir build
./build/bin/solve --nx 128 --ny 128 --tol 1e-8 --max-iters 5000

With OpenMP

cmake -S . -B build -DCMAKE_BUILD_TYPE=Release -DENABLE_OMP=ON
cmake --build build -j

With CUDA (requires CUDA toolkit)

cmake -S . -B build -DCMAKE_BUILD_TYPE=Release -DENABLE_CUDA=ON
cmake --build build -j
./build/bin/cuda_sanity --n 1000000

What’s implemented (MVP)

CSR sparse matrix
2D Poisson/Laplacian matrix generator (SPD)
SpMV (CPU; OpenMP-enabled if built with ENABLE_OMP)
BLAS-1 ops: dot, axpy, scal, norm2 (CPU; OpenMP-enabled)
CG solver (CPU)
Jacobi preconditioner
Minimal tests + benchmark-style CLI app

Math

The problem

We solve Ax = b where A is large and sparse.

Why CG?

Conjugate Gradient (CG) applies when A is symmetric positive definite (SPD). It builds a sequence of approximations x_k that (in exact arithmetic) converges in at most n steps, but in practice converges much sooner depending on conditioning.

Key quantities

Residual: r_k = b - A x_k
We stop when: ||r_k|| / ||b|| < tol

CG update (no preconditioner)

Given x0, r0 = b - A x0, p0 = r0

alpha_k = (r_k^T r_k) / (p_k^T A p_k)
x_{k+1} = x_k + alpha_k p_k
r_{k+1} = r_k - alpha_k A p_k
beta_k  = (r_{k+1}^T r_{k+1}) / (r_k^T r_k)
p_{k+1} = r_{k+1} + beta_k p_k

Preconditioning (Jacobi)

We apply an easy approximate inverse M^{-1} each iteration to reduce the effective condition number. Jacobi uses M = diag(A) so z = M^{-1} r is just elementwise scaling.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
apps		apps
sparse		sparse
src		src
test_results		test_results
tests		tests
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
README.md		README.md
run_tests.ps1		run_tests.ps1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

sparse-solve

Build

CPU only

With OpenMP

With CUDA (requires CUDA toolkit)

What’s implemented (MVP)

Math

The problem

Why CG?

Key quantities

CG update (no preconditioner)

Preconditioning (Jacobi)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

sparse-solve

Build

CPU only

With OpenMP

With CUDA (requires CUDA toolkit)

What’s implemented (MVP)

Math

The problem

Why CG?

Key quantities

CG update (no preconditioner)

Preconditioning (Jacobi)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages