Multi-cultural speech emotion recognition using language and speaker cues.
Sandeep Kumar PandeyHanumant Singh ShekhawatS. R. M. PrasannaPublished in: Biomed. Signal Process. Control. (2023)
Keyphrases
- text to speech synthesis
- prosodic features
- text to speech
- audio visual
- speech recognition
- speech synthesis
- speaker verification
- multimodal fusion
- emotion recognition
- speaker recognition
- automatic speech recognition
- language acquisition
- speaker diarization
- english text
- speaker identification
- language learning
- programming language
- facial expressions
- spoken language
- natural language
- speech signal
- emotional speech
- speaker dependent
- broadcast news
- multi modal
- speech recognizer
- cross cultural
- language generation
- language processing
- emotional state
- visual cues
- automatic transcription
- automatic speech recognition systems