Learning to Speak from Text: Zero-Shot Multilingual Text-to-Speech with Unsupervised Text Pretraining.
Takaaki SaekiSoumi MaitiXinjian LiShinji WatanabeShinnosuke TakamichiHiroshi SaruwatariPublished in: CoRR (2023)
Keyphrases
- text to speech
- word processing
- text to speech synthesis
- learning process
- writing skills
- text segmentation
- learning algorithm
- text retrieval
- reinforcement learning
- speech synthesis
- supervised learning
- weakly supervised
- text generation
- english text
- document analysis
- machine learning
- unsupervised learning
- multi lingual
- online learning
- natural language generation
- sentence level
- speech recognition
- active learning
- digital libraries
- programming tool
- keywords
- information retrieval