Fuzzy-Neural-Network Based Audio-Visual Fusion for Speech Recognition.
Gin-Der WuHao-Shu TsaiPublished in: ICAIIC (2019)
Keyphrases
- audio visual
- speech recognition
- audio visual speech recognition
- person authentication
- multimodal fusion
- multi modal
- multi stream
- visual information
- language model
- neural network
- hidden markov models
- multimedia
- speech recognizer
- automatic speech recognition
- visual data
- emotion recognition
- digit recognition
- speech signal
- noisy environments
- speaker identification
- speech recognition systems
- speech synthesis
- pattern recognition
- speaker verification
- audio features
- text mining
- information extraction
- bayesian networks
- high level