Publication: Shfl-BW: Accelerating Deep Neural Network Inference with Tensor-Core Aware Weight Pruning.