Joint and Adversarial Training with ASR for Expressive Speech Synthesis.

Kaili Zhang Cheng Gong Wenhuan Lu Longbiao Wang Jianguo Wei Dawei Liu

Published in: ICASSP (2022)

Keyphrases

speech synthesis
speech recognition
speech corpus
automatic speech recognition
vocal tract
text to speech
prosodic features
training process
online learning
speech signal
test set
language model
hidden markov models
training set
training algorithm
database
real time
speech retrieval
training phase
training samples
data mining
noisy environments
image acquisition
small number
multi agent
e learning