Login / Signup

Mellotron: Multispeaker Expressive Voice Synthesis by Conditioning on Rhythm, Pitch and Global Style Tokens.

Rafael ValleJason LiRyan PrengerBryan Catanzaro
Published in: ICASSP (2020)
Keyphrases
  • databases
  • fundamental frequency
  • database
  • real time
  • neural network
  • computer vision
  • clustering algorithm
  • image segmentation
  • multiresolution
  • texture synthesis
  • global information
  • program synthesis