Wasserstein GAN and Waveform Loss-based Acoustic Model Training for Multi-speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder.
Yi ZhaoShinji TakakiHieu-Thi LuongJunichi YamagishiDaisuke SaitoNobuaki MinematsuPublished in: CoRR (2018)
Keyphrases