Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GPU code addition. #23

Open
wants to merge 7 commits into
base: develop
Choose a base branch
from
Open

GPU code addition. #23

wants to merge 7 commits into from

Conversation

karanchahal
Copy link
Owner

This pull request seeks to add building this project for the GPU. Currently, the kernel for the dot product is added (matrix multiplication).

To build for the GPU, the cuda compiler takes in main.cu as the source file. main.cu includes buildTensorflowGpu.h instead of buildTensorflow.h (as in the main.cpp case).

The buildTensorflowGpu.h file inserts code for the CUDA kernels into the project by including the gpu/defn.h and gpu/impl.h.

These two files contain the kernel code definitions and implementations respectively.

Finally to build for the cpu, we fire the make all command and to build for the GPU we fire the make gpu command.

@karanchahal karanchahal requested a review from uditarora April 10, 2019 19:57
@karanchahal karanchahal added the cuda Items related to code that runs on the GPU via CUDA label Apr 10, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cuda Items related to code that runs on the GPU via CUDA
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant