X-Pool: Cross-Modal Language-Video Attention for Text-Video Retrieval.
Satya Krishna GortiNoël VouitsisJunwei MaKeyvan GolestanMaksims VolkovsAnimesh GargGuang Wei YuPublished in: CoRR (2022)
Keyphrases
- video retrieval
- cross modal
- video search
- video segments
- video collections
- multi modal
- video database
- concept based video retrieval
- video data
- visual content
- video content
- content based retrieval
- multimedia retrieval
- video shots
- key frames
- video clips
- visual similarity
- content based video retrieval
- semantic gap
- visual data
- text retrieval
- news video
- retrieval systems
- image retrieval
- multimedia documents
- semantic concepts
- semantic content
- text mining
- video streams
- semantic video retrieval
- text documents