Efficient compilation of CUDA kernels for high-performance computing on FPGAs.
Alexandros PapakonstantinouKarthik GururajJohn A. StrattonDeming ChenJason CongWen-mei W. HwuPublished in: ACM Trans. Embed. Comput. Syst. (2013)
Keyphrases
- high performance computing
- parallel computing
- scientific computing
- hardware software
- massively parallel
- computational science
- general purpose
- computing environments
- databases
- parallel architectures
- field programmable gate array
- cost effective
- fine grained
- parallel implementation
- real time
- grid computing
- efficient implementation
- fault tolerance
- parallel programming
- software engineering