Fusing Visual Attention CNN and Bag of Visual Words for Cross-Corpus Speech Emotion Recognition.
Minji SeoMyungho KimPublished in: Sensors (2020)
Keyphrases
- visual attention
- bag of visual words
- speech emotion recognition
- visual words
- image representation
- saliency map
- image classification
- image descriptors
- eye tracking
- eye movements
- visual content
- action recognition
- vision system
- bag of words
- natural scenes
- higher level
- co occurrence
- multiscale
- probabilistic latent semantic analysis
- visual features
- spatial information
- image retrieval
- image content
- higher order
- visual information
- texture analysis
- video retrieval
- feature space
- object recognition
- machine learning
- image processing