Login / Signup
DELAN: Dual-Level Alignment for Vision-and-Language Navigation by Cross-Modal Contrastive Learning.
Mengfei Du
Binhao Wu
Jiwen Zhang
Zhihao Fan
Zejun Li
Ruipu Luo
Xuanjing Huang
Zhongyu Wei
Published in:
CoRR (2024)
Keyphrases
</>
learning algorithm
multi modal
visual recognition
cross modal
computer vision
learning tasks
e learning
information retrieval systems
image classification
higher level
contextual information
perceptual information