Login / Signup
Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning.
Mandela Patrick
Po-Yao Huang
Ishan Misra
Florian Metze
Andrea Vedaldi
Yuki M. Asano
João F. Henriques
Published in:
ICCV (2021)
Keyphrases
</>
space time
video representation
spatio temporal
cross modal
spatial and temporal
video sequences
object recognition
motion patterns
machine learning
computer vision
multimedia
video content
human actions