Login / Signup
Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval.
Yi Bin
Haoxuan Li
Yahui Xu
Xing Xu
Yang Yang
Heng Tao Shen
Published in:
ACM Multimedia (2023)
Keyphrases
</>
cross modal
multi modal
multimedia retrieval
image retrieval
multimedia databases
visual recognition
data streams
perceptual information
content based retrieval
text retrieval
visual similarity
visual features
document retrieval