Login / Signup
Improving Language Model-Based Zero-Shot Text-to-Speech Synthesis with Multi-Scale Acoustic Prompts.
Shun Lei
Yixuan Zhou
Liyang Chen
Dan Luo
Zhiyong Wu
Xixin Wu
Shiyin Kang
Tao Jiang
Yahui Zhou
Yuxing Han
Helen Meng
Published in:
CoRR (2023)
Keyphrases
</>
text to speech synthesis
multiscale
text to speech
coarse to fine
data driven
multiple scales
natural images
scale space
database
database systems
image segmentation
edge detection
data model
image representation
multiresolution
local binary pattern
face recognition