Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval.
Yi BinHaoxuan LiYahui XuXing XuYang YangHeng Tao ShenPublished in: CoRR (2023)
Keyphrases
- cross modal
- multi modal
- multimedia retrieval
- image retrieval
- multimedia databases
- data streams
- visual similarity
- perceptual information
- visual recognition
- visual data
- multimedia information retrieval
- multimedia
- content based retrieval
- information retrieval
- text retrieval
- multimedia data
- retrieval systems
- information retrieval systems