Towards Comprehensive Multimodal Perception: Introducing the Touch-Language-Vision Dataset.
Ning ChengYou LiJing GaoBin FangJinan XuWenjuan HanPublished in: CoRR (2024)
Keyphrases
- visual perception
- vision system
- computer vision
- programming language
- real time
- machine intelligence
- visual processing
- language learning
- benchmark datasets
- color vision
- multi modal
- natural language
- multimedia
- image processing
- natural language processing
- synthetic datasets
- real world
- database
- human computer interaction
- feature selection
- language processing
- object oriented programming
- neural network
- multimodal interfaces