Scalable multi-GPU 3-D FFT for TSUBAME 2.0 supercomputer.
Akira NukadaKento SatoSatoshi MatsuokaPublished in: SC (2012)
Keyphrases
- floating point
- massively parallel
- building blocks
- parallel computing
- real time
- graphics processing units
- frequency domain
- fourier transform
- parallel processing
- fast fourier transform
- artificial intelligence
- lightweight
- web scale
- efficient implementation
- spectral analysis
- parallel implementation
- real world
- neural network