M2-CTTS: End-to-End Multi-scale Multi-modal Conversational Text-to-Speech Synthesis.

Published in: CoRR (2023)

Keyphrases