Login / Signup
Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior.
Guangzhi Sun
Yu Zhang
Ron J. Weiss
Yuan Cao
Heiga Zen
Andrew Rosenberg
Bhuvana Ramabhadran
Yonghui Wu
Published in:
CoRR (2020)
Keyphrases
</>
fine grained
text to speech
autoregressive
moving average
speech synthesis
coarse grained
prosodic features
access control
non stationary
programming tool
text to speech synthesis
random fields
word processing
state space model
image processing
sar images
graphical models