Login / Signup
Audio is all in one: speech-driven gesture synthetics using WavLM pre-trained model.
Fan Zhang
Naye Ji
Fuxing Gao
Siyuan Zhao
Zhaohan Wang
Shunman Li
Published in:
CoRR (2023)
Keyphrases
</>
probabilistic model
neural network
computer vision
high dimensional
facial expressions
statistical model