WISE: Word-Level Interaction-Based Multimodal Fusion for Speech Emotion Recognition.
Guang ShenRiwei LaiRui ChenYu ZhangKejia ZhangQilong HanHongtao SongPublished in: INTERSPEECH (2020)
Keyphrases
- word level
- speech emotion recognition
- multimodal fusion
- multimodal interfaces
- language independent
- machine translation
- document images
- document analysis
- n gram
- human computer interaction
- audio visual
- high robustness
- character recognition
- machine learning
- information seeking
- multimodal interaction
- text mining
- relevance feedback
- feature extraction
- artificial intelligence