Sign in

Audio is all in one: speech-driven gesture synthetics using WavLM pre-trained model.

Fan ZhangNaye JiFuxing GaoSiyuan ZhaoZhaohan WangShunman Li
Published in: CoRR (2023)
Keyphrases
  • probabilistic model
  • neural network
  • computer vision
  • high dimensional
  • facial expressions
  • statistical model