Login / Signup

An Efficient Vectorization Approach to Nested Thread-level Parallelism for CUDA GPUs.

Shixiong XuDavid Gregg
Published in: PACT (2015)
Keyphrases