Leveraging Semantic Information for Efficient Self-Supervised Emotion Recognition with Audio-Textual Distilled Models.
Danilo de OliveiraNavin Raj PrabhuTimo GerkmannPublished in: INTERSPEECH (2023)
Keyphrases
- semantic information
- emotion recognition
- keywords
- audio visual
- semantic analysis
- visual information
- low level features
- domain knowledge
- wordnet
- multimedia
- emotional speech
- semantic features
- information fusion
- low level
- contextual information
- high level
- image representation
- visual data
- databases
- expert systems
- face recognition
- search engine
- artificial intelligence