Multimodal Multifaceted Music Emotion Recognition Based on Self-Attentive Fusion of Psychology-Inspired Symbolic and Acoustic Features.
Jiahao ZhaoKazuyoshi YoshiiPublished in: APSIPA ASC (2023)
Keyphrases
- acoustic features
- music retrieval
- music information retrieval
- audio features
- speaker verification
- audio visual
- visual features
- speech signal
- automatic speech recognition
- multi modal
- cross correlation
- low level
- human computer interaction
- visual attention
- visual information
- machine learning
- noisy environments
- information retrieval systems
- feature vectors
- pattern recognition