Login / Signup

A Survey of Vision-Language Pre-training from the Lens of Multimodal Machine Translation.

Jeremy GwinnupKevin Duh
Published in: CoRR (2023)
Keyphrases