Sign in

SpeechComposer: Unifying Multiple Speech Tasks with Prompt Composition.

Yihan WuSoumi MaitiYifan PengWangyou ZhangChenda LiYuyue WangXihua WangShinji WatanabeRuihua Song
Published in: CoRR (2024)
Keyphrases
  • text to speech
  • multiple tasks
  • database
  • computer vision
  • pattern recognition
  • real time
  • neural network
  • feature vectors
  • mobile robot
  • multi modal