Login / Signup

Multi-entity Video Transformers for Fine-Grained Video Representation Learning.

Matthew WalmerRose Catherine KanjirathinkalKai Sheng TaiKeyur MuzumdarTai-Peng TianAbhinav Shrivastava
Published in: CoRR (2023)
Keyphrases
  • fine grained
  • video representation
  • coarse grained
  • access control
  • spatio temporal
  • video data
  • video sequences
  • key frames
  • video analysis
  • search engine
  • prior knowledge
  • video streams