A Performance Optimization Framework for Compilation of Tensor Contraction Expressions into Parallel Programs.
Gerald BaumgartnerDavid E. BernholdtDaniel CociorvaChi-Chung LamJ. RamanujamRobert J. HarrisonMarcel NooijenP. SadayappanPublished in: IPDPS (2002)