Login / Signup
JDI-T: Jointly Trained Duration Informed Transformer for Text-To-Speech without Explicit Alignment.
Dan Lim
Won Jang
Gyeonghwan O
Heayoung Park
Bongwan Kim
Jaesam Yoon
Published in:
INTERSPEECH (2020)
Keyphrases
</>
text to speech
speech synthesis
prosodic features
fuzzy logic
text to speech synthesis
multilayer perceptron
programming tool
word processing
english text
fault diagnosis
dynamic time warping
training process
multi layer perceptron
image alignment
power system
writing skills
hidden semi markov models
neural network