Fusion of mel and gammatone frequency cepstral coefficients for speech emotion recognition using deep C-RNN.
U. KumaranRadha RamMohan S.Senthil Murugan NagarajanA. PrathikPublished in: Int. J. Speech Technol. (2021)
Keyphrases
- cepstral coefficients
- speech recognition
- speech signal
- linear predictive
- recurrent neural networks
- linear prediction
- linear predictive coding
- feature set
- hidden markov models
- nearest neighbor
- audio signal
- automatic speech recognition
- pattern recognition
- noisy environments
- language model
- facial expressions
- spectral analysis
- speaker identification
- image fusion
- visual speech
- non stationary
- visual features
- feature space
- neural network