Login / Signup
VALL-E R: Robust and Efficient Zero-Shot Text-to-Speech Synthesis via Monotonic Alignment.
Bing Han
Long Zhou
Shujie Liu
Sanyuan Chen
Lingwei Meng
Yanming Qian
Yanqing Liu
Sheng Zhao
Jinyu Li
Furu Wei
Published in:
CoRR (2024)
Keyphrases
</>
text to speech synthesis
computationally efficient
robust estimation
neural network
decision trees
evolutionary algorithm
computationally expensive
data sets
machine learning
computer vision
image processing
lightweight
cost effective
highly efficient