Optimizing Modular Multiplication for NVIDIA's Maxwell GPUs.

Niall Emmart Justin Luitjens Charles C. Weems Cliff Woolley

Published in: ARITH (2016)

Keyphrases

graphics processing units
floating point
graphics hardware
gpu implementation
graphics processors
general purpose
modular neural networks
parallel processing
modular structure
high performance computing
parallel computation
arithmetic operations
processing units
parallel programming
neural network
massively parallel
computing systems
machine learning