Deep Learning for the Detection of Emotion in Human Speech: The Impact of Audio Sample Duration and English versus Italian Languages.
Alexander WurstMichael W. HopwoodSifan WuFei LiYu-Dong YaoPublished in: WOCC (2023)
Keyphrases
- deep learning
- emotion recognition
- english text
- broadcast news
- text to speech
- emotional state
- spoken language
- language identification
- speaker identification
- audio visual
- audio stream
- cross lingual
- target language
- unsupervised learning
- unsupervised feature learning
- machine translation
- speech recognition
- facial expressions
- automatic speech recognition
- weakly supervised
- object detection
- natural language
- machine learning
- mental models
- cross language
- active learning
- computer vision