A phone-viseme dynamic Bayesian network for audio-visual automatic speech recognition.
Louis H. TerryAggelos K. KatsaggelosPublished in: ICPR (2008)
Keyphrases
- audio visual
- automatic speech recognition
- hidden markov models
- speech recognition
- multi modal
- speech signal
- visual speech
- visual information
- speech corpus
- visual data
- noisy environments
- broadcast news
- speaker verification
- emotion recognition
- multimedia
- acoustic features
- audio features
- human motion
- multiscale
- bayesian networks
- computer vision