• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

Noise-robust zero-shot text-to-speech synthesis conditioned on self-supervised speech-representation model with adapters.

Kenichi FujitaHiroshi SatoTakanori AshiharaHiroki KanagawaMarc DelcroixTakafumi MoriyaYusuke Ijima
Published in: CoRR (2024)
Keyphrases