Login / Signup
Synthesis of Expressive Speaking Styles with Limited Training Data in a Multi-Speaker, Prosody-Controllable Sequence-to-Sequence Architecture.
Slava Shechtman
Raul Fernandez
Alexander Sorin
David Haws
Published in:
Interspeech (2021)
Keyphrases
</>
training data
domain knowledge
data sets
audio visual
machine learning
genetic algorithm
learning algorithm
feature vectors
prior knowledge
active learning
hidden markov models
semi supervised
speech recognition