AnyoneNet: Synchronized Speech and Talking Head Generation for Arbitrary Person.
Xinsheng WangQicong XieJihua ZhuLei XieOdette ScharenborgPublished in: CoRR (2021)
Keyphrases
- speech recognition
- audio visual
- speech synthesis
- facial gestures
- automatic speech recognition
- speaker identification
- automatic speech recognition systems
- visual focus of attention
- magnetic recording
- recognition engine
- hand movements
- text to speech
- spoken dialogue systems
- neural network
- spoken language
- head pose estimation
- dialogue system
- speech signal
- pose estimation
- video sequences