A Survey of Deep Learning-Based Multimodal Emotion Recognition: Speech, Text, and Face.
Hailun LianCheng LuSunan LiYan ZhaoChuangao TangYuan ZongPublished in: Entropy (2023)
Keyphrases
- emotion recognition
- deep learning
- audio visual
- facial expressions
- facial images
- emotional speech
- multi stream
- multi modal
- unsupervised learning
- human faces
- emotion classification
- human computer interaction
- machine learning
- face images
- information fusion
- visual information
- sentiment analysis
- text mining
- emotional state
- facial features
- multimedia
- visual data
- mental models
- information retrieval
- speech recognition
- affective states
- semantic information
- knowledge discovery
- video sequences
- sentence level
- face recognition
- computer vision