Efficient Fork-Join on GPUs Through Warp Specialization.
Arpith Chacko JacobAlexandre E. EichenbergerHyojin SungSamuel F. AntãoGheorghe-Teodor BerceaCarlo BertolliAlexey BataevTian JinTong ChenZehra SuraGeorgios RokosKevin O'BrienPublished in: HiPC (2017)