Login / Signup
Improving Language Model-Based Zero-Shot Text-to-Speech Synthesis with Multi-Scale Acoustic Prompts.
Shun Lei
Yixuan Zhou
Liyang Chen
Dan Luo
Zhiyong Wu
Xixin Wu
Shiyin Kang
Tao Jiang
Yahui Zhou
Yuxing Han
Helen Meng
Published in:
ICASSP (2024)
Keyphrases
</>
text to speech synthesis
multiscale
text to speech
wavelet transform
computer vision
image segmentation
natural images
multiple scales
learning algorithm
scale space
programming language
pattern languages
optic flow
mental models
object oriented
image processing
artificial intelligence