Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language Navigation.
Yibo CuiLiang XieYakun ZhangMeishan ZhangYe YanErwei YinPublished in: CoRR (2023)
Keyphrases
- computer vision
- real time
- natural language
- vision system
- programming language
- entity identification
- training algorithm
- data sets
- coreference resolution
- specification language
- training phase
- training process
- test set
- natural language processing
- supervised learning
- language learning
- training set
- training data
- high level
- computational linguistics
- image processing
- e learning
- neural network