Login / Signup
Cross-modal Attention Congruence Regularization for Vision-Language Relation Alignment.
Rohan Pandey
Rulin Shao
Paul Pu Liang
Ruslan Salakhutdinov
Louis-Philippe Morency
Published in:
CoRR (2022)
Keyphrases
</>
cross modal
multi modal
visual recognition
image retrieval
computer vision
multimedia retrieval
natural language
multimedia databases
feature selection
co occurrence
image classification
data processing
visual data