On the Efficacy and Noise-Robustness of Jointly Learned Speech Emotion and Automatic Speech Recognition.
Lokesh BansalS. Pavankumar DubaguntaMalolan ChetlurPushpak JagtapAravind GanapathirajuPublished in: INTERSPEECH (2023)
Keyphrases
- automatic speech recognition
- noisy environments
- speech recognition
- speech signal
- spectral subtraction
- word error rate
- hidden markov models
- broadcast news
- speech corpus
- conversational speech
- speech enhancement
- word recognition
- spontaneous speech
- speaker verification
- speech retrieval
- speech sounds
- speaker identification
- speech synthesis
- spoken words
- acoustic features
- signal to noise ratio
- speech recognizers
- recognition errors
- speech segments
- formant frequencies
- phoneme recognition
- neural network
- noise reduction
- non stationary
- language model
- acoustic models
- speech recognition systems
- linear prediction
- pattern recognition
- multiscale