Self-supervised Cross-modal Pretraining for Speech Emotion Recognition and Sentiment Analysis.
Iek-Heng ChuZiyi ChenXinlu YuMei HanJing XiaoPeng ChangPublished in: EMNLP (Findings) (2022)
Keyphrases
- cross modal
- sentiment analysis
- speech emotion recognition
- multi modal
- opinion mining
- text classification
- sentiment classification
- sentence level
- visual recognition
- image retrieval
- text mining
- multimedia retrieval
- natural language processing
- product features
- multimedia databases
- high dimensional
- sentiment lexicon
- product reviews
- visual similarity
- knowledge representation
- visual data
- machine learning
- image understanding
- information extraction
- multimedia