-
cmake 3.3.0 or higher
- Debian/Ubuntu:
sudo apt-get install cmake file cmake-curses-gui
- Arch Linux:
sudo pacman --sync cmake
- Debian/Ubuntu:
-
cupla
[email protected]:ComputationalRadiationPhysics/cupla.git
export CUPLA_ROOT=<cupla_SRC_CODE_DIR>
- example:
mkdir -p $HOME/src
git clone [email protected]:ComputationalRadiationPhysics/cupla.git $HOME/src/cupla
cd $HOME/src/cupla
git submodule init
git submodule update
export CUPLA_ROOT=$HOME/src/cupla
- create build directory
mkdir -p buildCuplaExample
cd buildCuplaExample
cmake $CUPLA_ROOT/example/CUDASamples/matrixMul -D<ACC_TYPE>=ON
- list of supported ACC_TYPES
ALPAKA_ACC_CPU_B_SEQ_T_OMP2_ENABLE
ALPAKA_ACC_CPU_B_SEQ_T_THREADS_ENABLE
ALPAKA_ACC_GPU_CUDA_ENABLE
ALPAKA_ACC_CPU_B_OMP2_T_SEQ_ENABLE
(only allowed in combination withCUPLA_KERNEL_OPTI
andCUPLA_KERNEL_ELEM
, because theblockSize
must be dim3(1,1,1)) see TuningGuide.md
- list of supported ACC_TYPES
make -j
./matrixMul -wA=320 -wB=320 -hA=320 -hB=320
(parameters must be a multiple of 32!)