Login / Signup
Parameter Efficient Multimodal Transformers for Video Representation Learning.
Sangho Lee
Youngjae Yu
Gunhee Kim
Thomas M. Breuel
Jan Kautz
Yale Song
Published in:
CoRR (2020)
Keyphrases
</>
spatio temporal
prior knowledge
multimedia
video surveillance