NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models.
Zeqian JuYuancheng WangKai ShenXu TanDetai XinDongchao YangYanqing LiuYichong LengKaitao SongSiliang TangZhizheng WuTao QinXiang-Yang LiWei YeShikun ZhangJiang BianLei HeJinyu LiSheng ZhaoPublished in: CoRR (2024)
Keyphrases
- speech synthesis
- diffusion models
- speech recognition
- diffusion model
- information diffusion
- text to speech
- social networks
- prosodic features
- vocal tract
- video coding
- viral marketing
- influence maximization
- pattern recognition
- motion estimation
- speech signal
- anisotropic diffusion
- social network analysis
- hidden markov models