Conv_CUDA

A bare bones ML framework that mimics how PyTorch handles neural network architectures. This was implemented with custom CUDA kernels that will be profiled and optimized using Nsight Compute Systems profiler from Nvidia.

Build Instructions:

Within the data/mnist/ directory there is a bash script to download the MNIST dataset.

git clone --recursive <repo_url>

./install_mnist.sh

Then from the project root:

mkdir build && cd build
cmake ..
make

From project root, to run the training loop run:

./build/my_conv_app

It should work, and you can now try to change up the current network pipeline that is in the main.cu file.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
src		src
tests		tests
third_party		third_party
.gitignore		.gitignore
.gitmodules		.gitmodules
CMakeLists.txt		CMakeLists.txt
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Conv_CUDA

Build Instructions:

Within the data/mnist/ directory there is a bash script to download the MNIST dataset.

From project root, to run the training loop run:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Conv_CUDA

Build Instructions:

Within the data/mnist/ directory there is a bash script to download the MNIST dataset.

From project root, to run the training loop run:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages