Sign in

Fine-Grained Visual Textual Alignment for Cross-Modal Retrieval Using Transformer Encoders.

Nicola MessinaGiuseppe AmatoAndrea EsuliFabrizio FalchiClaudio GennaroStéphane Marchand-Maillet
Published in: ACM Trans. Multim. Comput. Commun. Appl. (2021)
Keyphrases