Multi-Source Transformer Architectures for Audiovisual Scene Classification.
Wim BoesHugo Van hammePublished in: CoRR (2022)
Keyphrases
- multi source
- scene classification
- object recognition
- data fusion
- image classification
- biologically inspired
- information fusion
- indoor outdoor
- natural scenes
- data integration
- multiple sources
- visual words
- image representation
- fuzzy logic
- data sources
- neural network
- bag of features
- visual information
- computer vision
- machine learning
- multi modal
- data model
- natural images
- higher order
- d objects