Bridging Modalities: Knowledge Distillation and Masked Training for Translating Multi-Modal Emotion Recognition to Uni-Modal, Speech-Only Emotion Recognition.
Muhammad MuazNathan PaullJahnavi MalagavalliPublished in: CoRR (2024)
Keyphrases
- emotion recognition
- multi modal
- audio visual
- uni modal
- emotional speech
- cross modal
- single modality
- multiple modalities
- speaker verification
- facial expressions
- multi modality
- video search
- visual information
- human computer interaction
- emotion classification
- high dimensional
- information retrieval
- emotional state
- audio features
- information fusion
- low level