Interpretable Multimodal Emotion Recognition using Hybrid Fusion of Speech and Image Data.
Puneet KumarSarthak MalikBalasubramanian RamanPublished in: CoRR (2022)
Keyphrases
- multimodal fusion
- audio visual
- image data
- multimodal interfaces
- emotion recognition
- human computer interaction
- visual data
- multi modal
- high robustness
- text to speech synthesis
- multimodal interaction
- learning mechanism
- speech recognition
- emotional state
- emotional speech
- data fusion
- visual information
- multi stream
- relevance feedback
- range data
- speech synthesis
- user interface
- speaker verification
- video sequences
- endpoint detection
- text to speech
- fusion algorithm
- raw data
- multispectral
- speech signal
- multi sensor
- information fusion
- multimodal biometrics
- virtual environment
- image database
- multimedia
- multimodal medical images