-RAAP: A Multi-Modal Recipe for Advancing Adaptation-based Pre-training towards Effective and Efficient Zero-shot Video-text Retrieval.
Xingning DongZipeng FengChunluan ZhouXuzheng YuMing YangQingpei GuoPublished in: SIGIR (2024)
Keyphrases
- multi modal
- text retrieval
- video search
- semantic concepts
- audio visual
- document collections
- document retrieval
- image retrieval
- inverted file
- video data
- multi modality
- retrieval systems
- query expansion
- high dimensional
- multimedia information retrieval
- multiple modalities
- information retrieval
- multimedia retrieval
- uni modal
- visual data
- video sequences
- multimedia
- learning algorithm
- high level