SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding.
Baoxiong JiaYixin ChenHuangyue YuYan WangXuesong NiuTengyu LiuQing LiSiyuan HuangPublished in: CoRR (2024)
Keyphrases
- language learning
- scene understanding
- vision system
- object detection
- object recognition
- computer vision
- scene recognition
- d scene
- video surveillance
- computer assisted language learning
- mobile learning
- real time
- language acquisition
- english language
- foreign language
- scene categorization
- english learning
- vocabulary learning
- mobile language learning
- learning tools
- semi supervised
- language learners
- three dimensional