Login / Signup
Zero and R2D2: A Large-scale Chinese Cross-modal Benchmark and A Vision-Language Framework.
Chunyu Xie
Heng Cai
Jianfei Song
Jincheng Li
Fanjing Kong
Xiaoyu Wu
Henrique Morimitsu
Lin Yao
Dexin Wang
Dawei Leng
Xiangyang Ji
Yafeng Deng
Published in:
CoRR (2022)
Keyphrases
</>
cross modal
multi modal
visual features
computer vision
nearest neighbor
information retrieval systems