Sign in

Multi-level network based on transformer encoder for fine-grained image-text matching.

Lei YangYong FengMingliang ZhouXiancai XiongYongheng WangBaohua Qiang
Published in: Multim. Syst. (2023)
Keyphrases