Interpretable Style Transfer for Text-to-Speech with ControlVAE and Diffusion Bridge.
Wenhao GuanTao LiYishuang LiHukai HuangQingyang HongLin LiPublished in: CoRR (2023)
Keyphrases
- text to speech
- speech synthesis
- prosodic features
- text to speech synthesis
- word processing
- transfer learning
- programming tool
- knowledge transfer
- english text
- diffusion process
- reliability assessment
- information retrieval
- cross domain
- anisotropic diffusion
- classification rules
- multi modal
- writing skills
- general purpose
- probabilistic model