Wasserstein GAN and Waveform Loss-Based Acoustic Model Training for Multi-Speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder.
Yi ZhaoShinji TakakiHieu-Thi LuongJunichi YamagishiDaisuke SaitoNobuaki MinematsuPublished in: IEEE Access (2018)