LocVTP: Video-Text Pre-training for Temporal Localization.
Meng CaoTianyu YangJunwu WengCan ZhangJue WangYuexian ZouPublished in: ECCV (26) (2022)
Keyphrases
- temporal information
- spatial and temporal
- activity detection
- space time
- temporal expressions
- natural language descriptions
- video data
- video search
- temporal consistency
- video analysis
- video sequences
- spatial temporal
- text retrieval
- temporal coherence
- video streams
- multimedia
- video database
- text detection
- information retrieval
- temporal segmentation
- video frames
- spatio temporal
- real time
- multimedia documents
- video content
- temporal analysis
- temporal domain
- temporal structure
- closed captions
- multimedia search
- video summarization
- dynamic textures
- keywords
- multimedia data
- training process
- video segments
- temporal data
- temporal relationships
- temporal correlation
- temporal reasoning
- video retrieval
- news video
- text documents
- supervised learning
- training set
- temporal constraints
- video clips
- localization method
- moving objects
- temporal redundancy
- spatio temporally