Schlieren imaging and video classification of alphabet pronunciations: exploiting phonetic flows for speech recognition and speech therapy.
Mohamed TalaatKian BarariXiuhua April SiJinxiang XiPublished in: Vis. Comput. Ind. Biomed. Art (2024)
Keyphrases
- speech recognition
- video classification
- speech recognizer
- automatic speech recognition
- speech signal
- speech recognition systems
- speech synthesis
- hidden markov models
- speech processing
- language model
- speaker independent
- pattern recognition
- speech recognition technology
- video shots
- image processing
- speaker identification
- video content
- noisy environments
- speech recognizers
- video clips
- speech retrieval
- acoustic models
- speech recognition errors
- isolated word
- speaker adaptation
- computer vision
- spoken term detection
- generative model
- cepstral coefficients