Speech Synthesis with Self-Supervisedly Learnt Prosodic Representations.

Zhaoci Liu Zhen-Hua Ling Ya-Jun Hu Jia Pan Jin-Wei Wang Yun-Di Wu

Published in: INTERSPEECH (2023)

Keyphrases

speech synthesis
prosodic features
speech recognition
text to speech
vocal tract
text to speech synthesis
speech corpus
higher level
automatic speech recognition
hidden markov models
language model
symbolic representation
multiple representations
databases
noisy environments
real time
external representations
data sets
real world
computer vision
word processing
database systems
information systems
expert systems
bayesian networks