Audio is all in one: speech-driven gesture synthetics using WavLM pre-trained model.

Published in: CoRR (2023)

Keyphrases