Sign in

Transform, contrast and tell: Coherent entity-aware multi-image captioning.

Jingqiang Chen
Published in: Comput. Vis. Image Underst. (2024)
Keyphrases