Enhancing the Programmability and Performance Portability of GPU Tensor Operations.

Arya MazaheriJohannes SchulteMatthew W. MoskewiczFelix WolfAli Jannesari
Published in: Euro-Par (2019)
Keyphrases
  • high order
  • higher order
  • real time
  • graphics hardware
  • information systems
  • parallel computation
  • image processing
  • multiscale
  • gpu implementation