Sign in

Empirical performance model-driven data layout optimization and library call selection for tensor contraction expressions.

Qingda LuXiaoyang GaoSriram KrishnamoorthyGerald BaumgartnerJ. RamanujamP. Sadayappan
Published in: J. Parallel Distributed Comput. (2012)
Keyphrases