Controllable speech synthesis by learning discrete phoneme-level prosodic representations.

Published in: CoRR (2022)

Keyphrases