Exploiting Visual Features Using Bayesian Gated Neural Networks for Disordered Speech Recognition.
Shansong LiuShoukang HuYi WangJianwei YuRongfeng SuXunying LiuHelen MengPublished in: INTERSPEECH (2019)
Keyphrases
- visual features
- speech recognition
- neural network
- pattern recognition
- image classification
- visual information
- visual content
- image retrieval
- hidden markov models
- language model
- automatic speech recognition
- speech synthesis
- semantic concepts
- image collections
- low level
- image search
- speech recognition systems
- speech recognizer
- low level features
- keywords
- noisy environments
- image annotation
- speech signal
- semantic features
- bayesian networks
- speaker identification
- key frames
- speaker independent
- visual data