Login / Signup
Stream weight optimization of speech and lip image sequence for audio-visual speech recognition.
Satoshi Nakamura
Hidetoshi Ito
Kiyohiro Shikano
Published in:
INTERSPEECH (2000)
Keyphrases
</>
audio visual speech recognition
multi stream
audio visual
image sequences
hidden markov models
spatio temporal
speech recognition
visual information
visual data
video sequences
computer vision
three dimensional
multi modal