High-Performance GPU-to-CPU Transpilation and Optimization via High-Level Parallel Constructs.
William S. MosesIvan R. IvanovJens DomkeToshio EndoJohannes DoerfertOleksandr ZinenkoPublished in: PPoPP (2023)
Keyphrases
- graphics processing units
- high level
- general purpose
- pc cluster
- parallel processing
- parallel computing
- parallel implementation
- gpu implementation
- parallel computation
- real time
- parallel programming
- computing systems
- scientific computing
- highly parallel
- low level
- graphics hardware
- processing units
- graphics processors
- massively parallel
- high performance computing
- compute unified device architecture
- cpu implementation
- optimization problems
- efficient implementation
- floating point
- programming language
- global optimization
- multithreading
- higher level
- distributed memory
- ibm sp
- parallel computers
- parallel architectures
- shared memory
- optimization algorithm
- level parallelism