A multimodal hierarchical approach to speech emotion recognition from audio and text.
Prabhav SinghRidam SrivastavaK. P. S. RanaVineet KumarPublished in: Knowl. Based Syst. (2021)
Keyphrases
- speech emotion recognition
- text graphics
- audio visual
- multimedia
- cross modal
- multi modal
- multimodal fusion
- signal processing
- human language
- information retrieval
- multimodal information
- hierarchical structure
- story segmentation
- text data
- keywords
- visual information
- text to speech
- text retrieval
- audio content
- free text
- cross media retrieval
- news video
- digital video
- textual information
- single modality
- hierarchical clustering
- visual data
- relevance feedback
- database