PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-to-Speech Using Natural Language Descriptions.
Reo ShimizuRyuichi YamamotoMasaya KawamuraYuma ShirahataHironori DoiTatsuya KomatsuKentaro TachibanaPublished in: CoRR (2023)
Keyphrases
- text to speech
- natural language descriptions
- prosodic features
- speech synthesis
- natural language
- programming tool
- speaker verification
- text to speech synthesis
- identity management
- english text
- word processing
- speech recognition
- writing skills
- audio visual
- speaker diarization
- human computer interaction
- software engineering
- pattern recognition
- neural network