Skip to content

eth-cscs/gpu-benchmarks

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GPU benchmarks

A collection of GPU benchmarks to evaluate software stack performance. The tests are written in CUDA with a simple HIP compatibility layer that allows the tests to be run on AMD GPUs without modification while not requiring HIP as a dependency on NVIDIA systems.

ROCm core API

  • Memory allocations
  • Page faults
  • Launch latencies
  • Memory access latencies
  • Memory bandwidth

Parallel algorithms

Support for both ROCm's rocPRIM and NVIDIA's cub/thrust.

  • Radix sort
  • Prefix sums
  • Reductions

rocSOLVER

rocFFT

About

Testing the performance of AMD's software stack from basic parallel primitives to ML-frameworks

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •