Multimodal speech emotion recognition based on multi-scale MFCCs and multi-view attention mechanism.
Lin FengLu-Yao LiuSheng-lan LiuJian ZhouHan-Qing YangJie YangPublished in: Multim. Tools Appl. (2023)
Keyphrases
- multi view
- speech emotion recognition
- attention mechanism
- multiscale
- visual attention
- multiple views
- single view
- saliency map
- depth map
- three dimensional
- natural images
- visual attention model
- d objects
- multi modal
- semi supervised
- image representation
- wavelet transform
- coarse to fine
- image processing
- multimedia
- multi view learning
- learning algorithm
- eye tracking
- multi view images
- multiple cameras
- visual features
- audio visual
- high resolution
- object recognition
- keywords
- computer vision