DiCLET-TTS: Diffusion Model Based Cross-Lingual Emotion Transfer for Text-to-Speech - A Study Between English and Mandarin.
Tao LiChenxu HuJian CongXinfa ZhuJingbei LiQiao TianYuping WangLei XiePublished in: IEEE ACM Trans. Audio Speech Lang. Process. (2023)