Multimodal Recognition of Valence, Arousal and Dominance via Late-Fusion of Text, Audio and Facial Expressions.
Fabrizio NunnariAnnette RiosUwe ReichelChirag BhuvaneshwaraPanagiotis Paraskevas FilntisisPetros MaragosFelix BurkhardtFlorian EybenBjörn W. SchullerSarah EblingPublished in: ESANN (2023)
Keyphrases
- facial expressions
- emotional state
- emotion recognition
- late fusion
- facial expression recognition
- recognition of facial expressions
- cross media
- multimedia
- facial action units
- facial actions
- audio visual
- human faces
- face images
- video sequences
- image retrieval
- face recognition
- multi modal
- visual features
- text mining
- object recognition
- action recognition
- feature extraction
- machine learning
- video frames
- wavelet transform
- keywords
- search engine