Login / Signup
MSStyleTTS: Multi-Scale Style Modeling With Hierarchical Context Information for Expressive Speech Synthesis.
Shun Lei
Yixuan Zhou
Liyang Chen
Zhiyong Wu
Xixin Wu
Shiyin Kang
Helen Meng
Published in:
IEEE ACM Trans. Audio Speech Lang. Process. (2023)
Keyphrases
</>
speech synthesis
multiscale
speech recognition
coarse to fine
image processing
edge detection
data sets
databases
non stationary
modeling method
real world
computer vision
wavelet transform
image coding
modeling framework
text to speech