Multimodal Speech Emotion Recognition Using Modality-Specific Self-Supervised Frameworks.
Rutherford Agbeshi PatamiaPaulo E. SantosKingsley Nketia AcheampongFavour EkongKwabena SarpongShe KunPublished in: SMC (2023)
Keyphrases
- multi modal
- audio visual
- multimodal interfaces
- emotion recognition
- multimodal fusion
- emotional state
- multimedia
- multimodal interaction
- speech recognition
- medical images
- higher level
- language acquisition
- spoken language
- high level
- human computer interaction
- domain specific
- video sequences
- multi stream
- speech synthesis
- data sets
- emotional speech