Skip to content

If an FFT is the only thing you are gonna do with your data, you may as well just do it on your CPU...

Notifications You must be signed in to change notification settings

AnthonyYao7/cuda_fft_is_slow

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

cuda_fft_is_slow

If an FFT is the only thing you are gonna do with your data, you may as well just do it on your CPU...

This is the lesson I learned while coding up this GPU based FFT algorithm. The actual processing is, as you would expect, incredibly fast. But it accounts for only about 1% of 1% of the entire runtime of the whole GPU FFT procedure. The remainder of the time is spent copying the data of interest into VRAM from DRAM. Moral? Do more than just an FFT with your data.

Benchmarking

2^24 point FFT

Alg Time
GPU 547 ms
GPU (excluding memory ops) 67 us
CPU (numpy) 971 ms
CPU (mine) 17 s

About

If an FFT is the only thing you are gonna do with your data, you may as well just do it on your CPU...

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published