Multi-Granularity and Multi-modal Feature Interaction Approach for Text Video Retrieval.
Wenjun LiShudong WangDong ZhaoShenghui XuZhaoming PanZhimin ZhangPublished in: CoRR (2024)
Keyphrases
- multi modal
- video retrieval
- video search
- multi granularity
- concept based video retrieval
- video collections
- video database
- multi user
- semantic gap
- visual content
- video data
- dynamic integration
- audio visual
- content based retrieval
- multiple modalities
- video clips
- information retrieval
- image annotation
- retrieval systems
- human computer interaction
- semantic concepts
- keywords
- video content
- key frames
- video shots
- feature vectors
- high dimensional
- location aware
- image search
- video streams
- multimedia