This code demonstrates a usage of cuBLAS RotEx
function to apply the Givens rotation matrix to vector x and y
A = | 1.0 | 2.0 | 3.0 | 4.0 |
B = | 5.0 | 6.0 | 7.0 | 8.0 |
This function is an extension to the routine cublas<t>rot
where input data, output data, cosine/sine type, and compute type can be specified independently.
See documentation for further details.
All GPUs supported by CUDA Toolkit (https://developer.nvidia.com/cuda-gpus)
Linux
Windows
x86_64
ppc64le
arm64-sbsa
- A Linux/Windows system with recent NVIDIA drivers.
- CMake version 3.18 minimum
$ mkdir build
$ cd build
$ cmake ..
$ make
Make sure that CMake finds expected CUDA Toolkit. If that is not the case you can add argument -DCMAKE_CUDA_COMPILER=/path/to/cuda/bin/nvcc
to cmake command.
$ mkdir build
$ cd build
$ cmake -DCMAKE_GENERATOR_PLATFORM=x64 ..
$ Open cublas_examples.sln project in Visual Studio and build
$ ./cublas_RotEx_example
Sample example output:
A
1.00 2.00 3.00 4.00
=====
B
5.00 6.00 7.00 8.00
=====
A
8.10 11.40 14.70 18.00
=====
B
9.30 10.20 11.10 12.00
=====