Login / Signup
DELAN: Dual-Level Alignment for Vision-and-Language Navigation by Cross-Modal Contrastive Learning.
Mengfei Du
Binhao Wu
Jiwen Zhang
Zhihao Fan
Zejun Li
Ruipu Luo
Xuanjing Huang
Zhongyu Wei
Published in:
LREC/COLING (2024)
Keyphrases
</>
cross modal
information retrieval
learning algorithm
computer vision
image features
image database
semantic information