Gabor-based Audiovisual Fusion for Mandarin Chinese Speech Recognition.
Yan XuHongce WangZhongping DongYuexuan LiAndrew AbelPublished in: EUSIPCO (2022)
Keyphrases
- speech recognition
- hidden markov models
- language model
- speech processing
- automatic speech recognition
- speech signal
- speech recognition technology
- speech recognizer
- speech synthesis
- speech understanding
- pattern recognition
- noisy environments
- neural network
- visual information
- speech recognizers
- video retrieval
- multimedia content
- speech recognition systems
- cepstral coefficients
- emotion recognition
- speaker identification
- image fusion
- speech recognition errors
- isolated word
- speaker dependent
- audio visual
- computer vision