cuThomasBatch and cuThomasVBatch, CUDA Routines to compute batch of tridiagonal systems on NVIDIA GPUs.
Pedro Valero-LaraIvan Martínez-PérezRaül SirventXavier MartorellAntonio J. PeñaPublished in: Concurr. Comput. Pract. Exp. (2018)
Keyphrases
- floating point
- gpu implementation
- general purpose
- data sets
- graphics hardware
- intelligent systems
- graphics processors
- linear systems
- embedded systems
- machine learning
- computer systems
- knowledge based systems
- building blocks
- complex systems
- distributed systems
- parallel implementation
- management system
- parallel computing
- active learning
- parallel computation
- neural network
- real time