Leveraging Semantic Scene Characteristics and Multi-Stream Convolutional Architectures in a Contextual Approach for Video-Based Visual Emotion Recognition in the Wild.
Ioannis PikoulisPanagiotis Paraskevas FilntisisPetros MaragosPublished in: CoRR (2021)
Keyphrases
- multi stream
- emotion recognition
- audio visual
- visual data
- visual information
- contextual information
- emotional speech
- visual features
- hidden markov models
- high level
- high dimensional
- facial expressions
- multi modal
- three dimensional
- human computer interaction
- semantic information
- video sequences
- low level
- natural language
- image sequences
- input image
- image regions
- sentiment analysis