Speech Emotion Recognition Using 3D Convolutions and Attention-Based Sliding Recurrent Networks With Auditory Front-Ends.
Zhichao PengXingfeng LiZhi ZhuMasashi UnokiJianwu DangMasato AkagiPublished in: IEEE Access (2020)
Keyphrases
- recurrent networks
- emotion recognition
- recurrent neural networks
- text to speech synthesis
- emotional state
- biologically inspired
- neural network
- speech recognition
- feed forward
- speech signal
- facial expressions
- text to speech
- audio visual
- automatic speech recognition
- sliding window
- information processing
- emotional speech
- cross modal
- speech synthesis
- signal processing
- focus of attention
- human motion
- spoken language
- recognition engine