MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition.

Published in: CoRR (2022)

Keyphrases