Publication: Shrinking Temporal Attention in Transformers for Video Action Recognition.