A natural language processing-based approach: mapping human perception by understanding deep semantic features in street view images.
Haoran MaDongdong WuPublished in: CoRR (2023)
Keyphrases
- human perception
- street view
- semantic features
- natural language processing
- image data
- image features
- text detection
- input image
- image classification
- human visual system
- wordnet
- three dimensional
- image retrieval
- image database
- feature points
- information extraction
- machine learning
- image collections
- multiple images
- knowledge base
- search engine
- object recognition
- natural language
- face recognition
- computer vision
- information retrieval
- document clustering
- visual features
- knowledge representation
- spatio temporal
- multiscale