ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control With Decoupled Codec.
Shengpeng JiJialong ZuoMinghui FangSiqi ZhengQian ChenWen WangZiyue JiangHai HuangXize ChengRongjie HuangZhou ZhaoPublished in: CoRR (2024)