Multimodal Emotion Recognition from Raw Audio with Sinc-convolution.
Xiaohui ZhangWenjie FuMangui LiangPublished in: CoRR (2024)
Keyphrases
- emotion recognition
- audio visual
- multi modal
- emotional speech
- visual information
- multimedia
- multi stream
- human computer interaction
- audio features
- speaker verification
- visual data
- sentiment analysis
- high level
- facial expressions
- information fusion
- facial images
- physiological signals
- emotion classification
- high dimensional