Integrating Emotion Recognition with Speech Recognition and Speaker Diarisation for Conversations.
Wen WuChao ZhangPhilip C. WoodlandPublished in: CoRR (2023)
Keyphrases
- speech recognition
- emotion recognition
- audio visual
- automatic speech recognition
- speaker verification
- emotional speech
- hidden markov models
- speaker identification
- conversational speech
- language model
- human computer interaction
- speaker independent
- facial expressions
- sentiment analysis
- speech synthesis
- pattern recognition
- speech recognizer
- speaker dependent
- emotional state
- information fusion
- speaker diarization
- speech recognition systems
- speech signal
- acoustic models
- speaker adaptation
- affective states
- noisy environments
- facial images
- multi modal
- low level
- neural network