Login / Signup
Noise-robust zero-shot text-to-speech synthesis conditioned on self-supervised speech-representation model with adapters.
Kenichi Fujita
Hiroshi Sato
Takanori Ashihara
Hiroki Kanagawa
Marc Delcroix
Takafumi Moriya
Yusuke Ijima
Published in:
CoRR (2024)
Keyphrases
</>
probabilistic model
statistical model
high level
probability distribution
least squares
input data
computational model
input output
image processing
management system
markov random field
operating system
text to speech synthesis