Sign in

Audio-visual speech recognition integrating 3D lip information obtained from the Kinect.

Jianrong WangJu ZhangKiyoshi HondaJianguo WeiJianwu Dang
Published in: Multim. Syst. (2016)
Keyphrases
  • audio visual speech recognition
  • multiresolution
  • multi modal
  • multimedia
  • keywords
  • low level
  • motion estimation
  • contextual information
  • semantic information