Interpretable multimodal emotion recognition using hybrid fusion of speech and image data.
Puneet KumarSarthak MalikBalasubramanian RamanPublished in: Multim. Tools Appl. (2024)
Keyphrases
- multimodal fusion
- audio visual
- image data
- multimodal interfaces
- emotion recognition
- human computer interaction
- multi modal
- multimodal interaction
- visual data
- high robustness
- multi stream
- text to speech synthesis
- relevance feedback
- emotional state
- learning mechanism
- image quality
- data fusion
- information fusion
- multimodal biometrics
- multimedia
- image database
- speech recognition
- visual information
- image content
- emotional speech
- affect detection
- mr images
- hyperspectral
- speaker verification
- recognition engine
- facial expressions
- automatic speech recognition
- emotion classification
- multispectral
- classification rules
- spoken language
- fusion algorithm
- multi sensor
- speech signal