Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reflection Alpaka Update #111

Conversation

erikzenker
Copy link
Member

This PR introduces correct reflection calculations within alpaka kernels.

Sorry for the huge pull request ! Can't rebase because of subtree issues. The code was tested in #110.
It is still missing :

  • a nice random number generator interface
  • the parallel modes mpi and graybat both recreate mesh information again and again which can cost a lot of runtime
  • CMakeLists that can also compile for cuda target

erikzenker and others added 18 commits September 15, 2015 14:48
9dfdb96 fix missing OpenMP link flag
b9f099c fix foldrAll ICC bug
83ddac5 disable the OpenMP 4 back-end by default
8644064 fix Vec for Intel
819e5d9 fix boost 1.56 missing const bug
f9cd663 really fix Intel cpuid
330d983 remove incorrect docu
9f1b692 fix Intel compiler cpuid
1aa4c86 fix missing OMP_NUM_THREADS reset in getMaxOmpThreads
328e866 fix CUDA compilation
33c7888 remove ICC from the readme (untested / not compiling)
40a8465 always interpret all source files as .cu files for nvcc
25f4670 allow vectorize to be called without the element type
882c0a9 enhance documentation
05454a6 fix ambiguous template specialization for GetWorkDiv
5b70326 remove call to std::ref in BlockSharedAllocCudaBuiltIn
e15c40a fix fix AtomicOmpCritSec
afffe2f fix wrong atomic implementation for AccCpuOmp2Blocks
2a60bbb fix BufCudaRt destruction
062378d add ALPAKA_ADD_EXECUTABLE to alpakaConfig.cmake
b9a4125 use DimInt more consistently
919dc26 move ElemType from mem::view to elem
2807fc8 add initial ALPAKA_ADD_EXECUTABLE
f019e70 fix BufPlainPtrWrapper pitch
1ca1923 fix missing OpenMP linker flag
9ee231d fix getFreeGlobalMemSizeBytes
7e853c6 Merge pull request ComputationalRadiationPhysics#54 from psychocoderHPC/fix-cudaSet
6796eff Merge pull request ComputationalRadiationPhysics#55 from psychocoderHPC/fix-callingHostFunctionFromDevice
9f3d8e6 fix warning calling host function from device
000a250 fix wrong usage of `getPitchBytes<>()`
8be955d Merge pull request ComputationalRadiationPhysics#53 from psychocoderHPC/topic-suppressHostDeviceWarning
b7c877d Merge pull request ComputationalRadiationPhysics#52 from psychocoderHPC/tpoic-updateGitIgnore
0b94251 suppress host device warning
33a59be update `.gitignore`
237898f refactoring
d0ad945 implement getFreeGlobalMemSizeBytes
f85e233 allow accelerators to inherit from rand implementation
d96e8b5 fix CUDA set implemenentation

git-subtree-dir: include/alpaka
git-subtree-split: 9dfdb96b0cb2fc32a1f2e447de755905f7538bf4
@erikzenker erikzenker changed the title Reflection Alpaka UThispdate Reflection Alpaka Update Oct 8, 2015
@slizzered
Copy link
Contributor

computer says

@erikzenker
Copy link
Member Author

Success !!! 🐢

@slizzered slizzered added this to the 1.6 HASEonALPAKA milestone Oct 8, 2015
@slizzered
Copy link
Contributor

Wow, that diff is big enough to overwhelm the github diff functionality...
Next time, we should really think about creating a new branch with the cherrypicked+squashed commits and just in the end add the subtree-squash. I'll merge it for now, since this isn't dev and you did the evaluation in #110 (very nice evaluation, by the way! 🐱 )

I guess there needs to be a proper code-review after all the alpaka-stuff is done.

Thanks for all the effort, the results look great!

slizzered added a commit that referenced this pull request Oct 8, 2015
@slizzered slizzered merged commit b5febd4 into ComputationalRadiationPhysics:topic-alpaka Oct 8, 2015
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants