Multimodal Emotion Recognition Based on Deep Temporal Features Using Cross-Modal Transformer and Self-Attention.
Bubai MajiMonorama SwainRajlakshmi GuhaAurobinda RoutrayPublished in: ICASSP (2023)
Keyphrases
- cross modal
- emotion recognition
- multi modal
- audio visual
- visual data
- facial expressions
- sentiment analysis
- multimedia retrieval
- image retrieval
- visual recognition
- multimedia databases
- visual attention
- emotional state
- human computer interaction
- high dimensional
- data analysis
- visual similarity
- facial images
- multimedia
- text mining
- keywords