ED-TTS: Multi-Scale Emotion Modeling using Cross-Domain Emotion Diarization for Emotional Speech Synthesis.
Haobin TangXulong ZhangNing ChengJing XiaoJianzong WangPublished in: CoRR (2024)
Keyphrases
- speech synthesis
- cross domain
- text to speech
- emotional state
- emotion recognition
- multiscale
- speech recognition
- prosodic features
- transfer learning
- vocal tract
- domain adaptation
- multiple domains
- facial expressions
- sentiment classification
- affective states
- neural network
- knowledge transfer
- sentiment analysis
- language model
- e government
- image processing
- computer vision
- data mining