Login / Signup
Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning.
Mandela Patrick
Yuki Markus Asano
Bernie Huang
Ishan Misra
Florian Metze
João F. Henriques
Andrea Vedaldi
Published in:
CoRR (2021)
Keyphrases
</>
space time
video representation
spatio temporal
spatial and temporal
cross modal
video sequences
multi modal
machine learning
high dimensional
dynamic scenes
video synopsis
generative model