Towards zero-shot Text-based voice editing using acoustic context conditioning, utterance embeddings, and reference encoders.

Published in: CoRR (2022)

Keyphrases