Login / Signup
MM-TTS: Multi-Modal Prompt Based Style Transfer for Expressive Text-to-Speech Synthesis.
Wenhao Guan
Yishuang Li
Tao Li
Hukai Huang
Feng Wang
Jiayan Lin
Lingyan Huang
Lin Li
Qingyang Hong
Published in:
AAAI (2024)
Keyphrases
</>
multi modal
text to speech
text to speech synthesis
multi modality
image processing
audio visual
cross modal
humanoid robot
semantic concepts
markov random field
video search
word processing
fusing multiple
uni modal