Login / Signup

Synthesis of Expressive Speaking Styles with Limited Training Data in a Multi-Speaker, Prosody-Controllable Sequence-to-Sequence Architecture.

Slava ShechtmanRaul FernandezAlexander SorinDavid Haws
Published in: Interspeech (2021)
Keyphrases
  • training data
  • domain knowledge
  • data sets
  • audio visual
  • machine learning
  • genetic algorithm
  • learning algorithm
  • feature vectors
  • prior knowledge
  • active learning
  • hidden markov models
  • semi supervised
  • speech recognition