Login / Signup
Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-Speech.
Chung-Ming Chien
Jheng-Hao Lin
Chien-yu Huang
Po-chun Hsu
Hung-yi Lee
Published in:
CoRR (2021)
Keyphrases
</>
text to speech
speech recognition
prosodic features
speech synthesis
speaker verification
speaker recognition
maximum likelihood
using artificial neural networks