Masking Modalities for Cross-modal Video Retrieval.
Valentin GabeurArsha NagraniChen SunKarteek AlahariCordelia SchmidPublished in: CoRR (2021)
Keyphrases
- video retrieval
- cross modal
- multi modal
- content based retrieval
- multimedia retrieval
- visual content
- visual similarity
- semantic gap
- visual data
- video data
- visual recognition
- video search
- multimedia databases
- video content
- retrieval systems
- image retrieval
- key frames
- video clips
- search engine
- high dimensional
- semantic concepts
- multimedia data
- visual information
- contextual information
- active learning