Sign in

BOFFIN TTS: Few-Shot Speaker Adaptation by Bayesian Optimization.

Henry B. MossVatsal AggarwalNishant PrateekJavier GonzálezRoberto Barra-Chicote
Published in: ICASSP (2020)
Keyphrases
  • speaker adaptation
  • maximum likelihood
  • video shots
  • text to speech
  • machine learning
  • video data
  • information retrieval
  • computer vision
  • bayesian networks
  • video sequences
  • low level
  • image classification
  • visual features