Skip to content

Connie120/ECE60827-CUDA4

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ECE 60827 CUDA Programming Part 4

Professor T. N. Vijaykumar
TA: Ni Kang

Due on Mar 09, 2024

Introduction

The directions for this part are simple: take the computations kernels you created in the first 3 labs and use them to create AlexNet from them. To remind you, the architecture of AlexNet[[1]] is as follows.
AlexNet Architecture

Try to optimize the full AlexNet flow as much as possible - validating it on the CPU will be very difficult given how long it will take on the CPU. So simply measure the full AlexNet execution time at different batch sizes and report.

Completing this will give you serious street credit, you can now claim you implemented AlexNet from scratch using CUDA :)

References

[1] Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. 2012. ImageNet classification with deep con-volutional neural networks. In Proceedings of the 25th International Conference on Neural Information Processing Systems - Volume 1 (NIPS’12). Curran Associates Inc., Red Hook, NY, USA, 1097–1105.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published