Improving Performance of Floating Point Division on GPU and MIC.
Kun HuangYifeng ChenPublished in: ICA3PP (2) (2015)
Keyphrases
- floating point
- graphics processing units
- memory bandwidth
- fixed point
- gpu implementation
- graphics hardware
- real time
- square root
- parallel computation
- sparse matrices
- parallel implementation
- parallel computing
- instruction set
- parallel processing
- parallel programming
- fast fourier transform
- interval arithmetic
- state space
- parallel algorithm
- data processing
- image processing