Login / Signup
MSStyleTTS: Multi-Scale Style Modeling with Hierarchical Context Information for Expressive Speech Synthesis.
Shun Lei
Yixuan Zhou
Liyang Chen
Zhiyong Wu
Xixin Wu
Shiyin Kang
Helen Meng
Published in:
CoRR (2023)
Keyphrases
</>
speech synthesis
multiscale
speech recognition
coarse to fine
vocal tract
text to speech
wavelet transform
data sets
scale space
natural images
hierarchical structures
neural network
information systems
face recognition
image representation
local binary pattern