AnyoneNet: Synchronized Speech and Talking Head Generation for Arbitrary Person.

Xinsheng Wang Qicong Xie Jihua Zhu Lei Xie Odette Scharenborg

Published in: CoRR (2021)

Keyphrases

speech recognition
audio visual
speech synthesis
facial gestures
automatic speech recognition
speaker identification
automatic speech recognition systems
visual focus of attention
magnetic recording
recognition engine
hand movements
text to speech
spoken dialogue systems
neural network
spoken language
head pose estimation
dialogue system
speech signal
pose estimation
video sequences