Login / Signup

Improving Audio Codec-based Zero-Shot Text-to-Speech Synthesis with Multi-Modal Context and Large Language Model.

Jinlong XueYayue DengYichen HanYingming GaoYa Li
Published in: CoRR (2024)
Keyphrases