Vocoder-free text-to-speech synthesis incorporating generative adversarial networks using low-/multi-frequency STFT amplitude spectra.

Published in: Comput. Speech Lang. (2019)

Keyphrases