CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-training.
Zhenhui YeRongjie HuangYi RenZiyue JiangJinglin LiuJinzheng HeXiang YinZhou ZhaoPublished in: CoRR (2023)
Keyphrases
- text to speech
- learning algorithm
- supervised learning
- learning process
- information retrieval
- reinforcement learning
- text mining
- online learning
- learning systems
- learning speed
- word meanings
- neural network
- pattern languages
- context dependent
- training set
- training process
- audio visual
- feedforward neural networks
- language acquisition
- computer software
- mobile learning
- multimedia