Publication: Duplo: Lifting Redundant Memory Accesses of Deep Neural Networks for GPU Tensor Cores.