Login / Signup
Multi-entity Video Transformers for Fine-Grained Video Representation Learning.
Matthew Walmer
Rose Catherine Kanjirathinkal
Kai Sheng Tai
Keyur Muzumdar
Tai-Peng Tian
Abhinav Shrivastava
Published in:
CoRR (2023)
Keyphrases
</>
fine grained
video representation
coarse grained
access control
spatio temporal
video data
video sequences
key frames
video analysis
search engine
prior knowledge
video streams