Predicting Personalized Head Movement From Short Video and Speech Signal.
Ran YiZipeng YeZhiyao SunJuyong ZhangGuo-Xin ZhangPengfei WanHujun BaoYong-Jin LiuPublished in: IEEE Trans. Multim. (2023)
Keyphrases
- speech signal
- speech recognition
- head movements
- automatic speech recognition
- hidden markov models
- noisy environments
- video sequences
- video data
- automatic speech recognition systems
- multimedia
- video frames
- speaker identification
- video retrieval
- non stationary
- speech quality
- eye tracker
- video streams
- video surveillance
- video content
- eye tracking
- eye gaze
- fundamental frequency
- eye movements
- multi view
- language model
- key frames
- visual speech
- facial expressions