This project is a benchmark for CUDA's memcpy using the CUDA runtime API.
It supports the following memcpy kinds:
- host to host
- host to device
- device to host
- device to device
For more a more sophisticated benchmark see NVIDIA/nvbandwidth.
nvcc -O3 src/main.c && ./a.outLicensed under either of
- Apache License, Version 2.0 (LICENSES/Apache-2.0.txt or http://www.apache.org/licenses/LICENSE-2.0)
- MIT license (LICENSES/MIT.txt or http://opensource.org/licenses/MIT)
at your option.
Unless you explicitly state otherwise, any contribution intentionally submitted for inclusion in the work by you, as defined in the Apache-2.0 license, shall be dual licensed as above, without any additional terms or conditions.