Login / Signup
Cross-modal Attention Congruence Regularization for Vision-Language Relation Alignment.
Rohan Pandey
Rulin Shao
Paul Pu Liang
Ruslan Salakhutdinov
Louis-Philippe Morency
Published in:
ACL (1) (2023)
Keyphrases
</>
cross modal
multi modal
multimedia retrieval
computer vision
image retrieval
visual recognition
feature extraction
perceptual information
natural language
co occurrence
image classification
multimedia data
multimedia databases
visual data