Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language Navigation.
Yibo CuiLiang XieYakun ZhangMeishan ZhangYe YanErwei YinPublished in: ICCV (2023)
Keyphrases
- natural language
- programming language
- training set
- computer vision
- supervised learning
- face recognition
- artificial neural networks
- image registration
- language processing
- training samples
- training examples
- training phase
- named entities
- pattern languages
- real time
- modeling language
- training process
- test set
- online learning
- vision system