Optimizing Modular Multiplication for NVIDIA's Maxwell GPUs.
Niall EmmartJustin LuitjensCharles C. WeemsCliff WoolleyPublished in: ARITH (2016)
Keyphrases
- graphics processing units
- floating point
- graphics hardware
- gpu implementation
- graphics processors
- general purpose
- modular neural networks
- parallel processing
- modular structure
- high performance computing
- parallel computation
- arithmetic operations
- processing units
- parallel programming
- neural network
- massively parallel
- computing systems
- machine learning