Extending RNN-T-based speech recognition systems with emotion and language classification.
Zvi KonsHagai AronowitzEdmilson da Silva MoraisMatheus DamascenoHong-Kwang KuoSamuel ThomasGeorge SaonPublished in: INTERSPEECH (2022)
Keyphrases
- classification accuracy
- nearest neighbor
- pattern classification
- pattern recognition
- speech recognition systems
- support vector
- recurrent neural networks
- feature space
- feature vectors
- feature extraction
- image classification
- model selection
- feature selection
- machine learning
- natural language
- hidden markov models
- facial expressions
- text classification
- multi modal
- speech recognition