LAViTeR: Learning Aligned Visual and Textual Representations Assisted by Image and Caption Generation.

Published in: CoRR (2021)

Keyphrases