Speaker-aware cognitive network with cross-modal attention for multimodal emotion recognition in conversation.
Lili GuoYikang SongShifei DingPublished in: Knowl. Based Syst. (2024)
Keyphrases
- cross modal
- audio visual
- emotion recognition
- multi modal
- cognitive network
- visual data
- visual information
- high dimensional
- fuzzy cognitive maps
- image retrieval
- human computer interaction
- natural language
- feature selection
- facial expressions
- image regions
- information retrieval
- multimedia data
- sentiment analysis
- image annotation
- information fusion
- multimedia databases
- similarity measure
- computer vision