Publication: EgoViT: Pyramid Video Transformer for Egocentric Action Recognition.