MLLM as Video Narrator: Mitigating Modality Imbalance in Video Moment Retrieval.
Weitong CaiJiabo HuangShaogang GongHailin JinYang LiuPublished in: CoRR (2024)
Keyphrases
- video data
- video streams
- video content
- video sequences
- video indexing
- content based indexing
- space time
- multimedia
- video frames
- event recognition
- video database
- video analysis
- video clips
- real time
- video retrieval
- video search
- event detection
- content based video retrieval
- video surveillance
- spatial and temporal
- multimedia databases
- video segmentation
- video images
- content based access
- content based video
- real time video
- video dataset
- audio visual content
- news video
- multimedia information
- multimedia documents
- video processing
- multimedia data
- image database
- relevance feedback
- spatio temporal
- search engine