Login / Signup

Noise-robust zero-shot text-to-speech synthesis conditioned on self-supervised speech-representation model with adapters.

Kenichi FujitaHiroshi SatoTakanori AshiharaHiroki KanagawaMarc DelcroixTakafumi MoriyaYusuke Ijima
Published in: CoRR (2024)
Keyphrases