Login / Signup
Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech.
Pengfei Wu
Junjie Pan
Chenchang Xu
Junhui Zhang
Lin Wu
Xiang Yin
Zejun Ma
Published in:
CoRR (2021)
Keyphrases
</>
prosodic features
text to speech
text to speech synthesis
speaker verification
speech synthesis
supervised training
speech recognition
audio visual
automatic speech recognition
emotion recognition
training algorithm
active learning
facial expressions
word processing