CuMesh_ROCm: High-Performance Geometry Processing for PyTorch (ROCm/HIP)

CuMesh_ROCm is a ROCm/HIP port of JeffreyXiang/CuMesh, a GPU-accelerated library for high-performance 3D geometry processing directly within the PyTorch ecosystem.

The original CuMesh was CUDA-only. This fork converts all CUDA APIs to ROCm/HIP, enabling execution on AMD GPUs.

Key Features

GPU-Accelerated Mesh Operations: Topology queries, simplification, hole filling, cleaning — on AMD GPUs via ROCm
Remeshing: Narrow-band UDF + Dual Contouring
UV Unwrapping: GPU chart clustering + xatlas packing
cuBVH: Ray tracing and signed/unsigned distance queries (converted from cubvh)

Supported GPUs

GPU	Status	Notes
NVIDIA (CUDA)	✅	Use original CuMesh
AMD RDNA3 (gfx11xx)	✅	Tested, stable
AMD RDNA4 (gfx1201)	⚠️	Works at <500K elements. Large meshes crash due to rocPRIM bug #776

Installation

Prerequisites

Python >= 3.10
PyTorch >= 2.4 (with ROCm support)
ROCm >= 7.2

Build from Source

git clone https://github.com/ptj0225/CuMesh_ROCm.git --recursive
cd CuMesh_ROCm
pip install -e . --no-build-isolation

For specific GPU arch:

export GPU_ARCHS="gfx1201"  # default: native
pip install -e . --no-build-isolation

Branches

Branch	Description
`main`	HIP conversion via hipcub (CUB compatibility layer)
`rocprim-direct`	HIP conversion via rocPRIM direct calls (no hipcub dependency)

Conversion Details

The conversion from CUDA to ROCm/HIP was done using:

hipify-perl: Automated CUDA → HIP API translation (~95% automated)
Manual fixes: cuda::std → rocprim::tuple, half-precision flags, namespace corrections
cubvh: De-submoduled and fully converted to HIP (including thrust::cuda::par → thrust::hip::par)

Known Issues

gfx1201 (RDNA4) crash at >500K elements: Due to wavefront=32 not being fully supported in rocPRIM. See ROCm/rocPRIM#776
Dual contour memory fault: Hashmap miss can cause out-of-bounds access. Fixed with bounds check in simple_dual_contour.cu

API Reference

Same as upstream CuMesh. See examples directory.

License

MIT License

Acknowledgements

JeffreyXiang/CuMesh — original CUDA implementation
cubvh — CUDA BVH toolkit
xatlas — UV parameterization library
pamo — GPU parallel edge collapse reference

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
cumesh		cumesh
examples		examples
src		src
third_party		third_party
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CuMesh_ROCm: High-Performance Geometry Processing for PyTorch (ROCm/HIP)

Key Features

Supported GPUs

Installation

Prerequisites

Build from Source

Branches

Conversion Details

Known Issues

API Reference

License

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CuMesh_ROCm: High-Performance Geometry Processing for PyTorch (ROCm/HIP)

Key Features

Supported GPUs

Installation

Prerequisites

Build from Source

Branches

Conversion Details

Known Issues

API Reference

License

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages