Login / Signup
CMCU-CSS: Enhancing Naturalness via Commonsense-based Multi-modal Context Understanding in Conversational Speech Synthesis.
Yayue Deng
Jinlong Xue
Fengping Wang
Yingming Gao
Ya Li
Published in:
ACM Multimedia (2023)
Keyphrases
</>
multi modal
speech synthesis
speech recognition
contextual information
multi modality
high dimensional
context aware
cross modal
text to speech
semantic concepts
machine learning
feature selection
image processing
image segmentation
multiple modalities
fusing multiple