Enhancing the Stability of LLM-based Speech Generation Systems through Self-Supervised Representations.
Álvaro Martín-CortinasDaniel Sáez-TriguerosIván Vallés-PérezBiel Tura VecinoPiotr BilinskiMateusz LajszczakGrzegorz BeringerRoberto Barra-ChicoteJaime Lorenzo-TruebaPublished in: CoRR (2024)