An Experimental Analysis on Integrating Multi-Stream Spectro-Temporal, Cepstral and Pitch Information for Mandarin Speech Recognition.
Yow-Bang WangShang-wen LiLin-Shan LeePublished in: IEEE Trans. Speech Audio Process. (2013)
Keyphrases
- speaker identification
- speech recognition
- speech processing
- speech signal
- hidden markov models
- speaker independent
- noisy environments
- broadcast news
- language model
- speech recognizer
- automatic speech recognition
- speaker dependent
- speech synthesis
- pattern recognition
- multi stream
- contextual information
- machine learning
- audio visual speech recognition
- cross correlation
- multi modal
- high level
- image processing