Login / Signup

CosyVoice: A Scalable Multilingual Zero-shot Text-to-speech Synthesizer based on Supervised Semantic Tokens.

Zhihao DuQian ChenShiliang ZhangKai HuHeng LuYexin YangHangrui HuSiqi ZhengYue GuZiyang MaZhifu GaoZhijie Yan
Published in: CoRR (2024)
Keyphrases