Login / Signup
A Comparative Study of Self-Supervised Speech Representation Based Voice Conversion.
Wen-Chin Huang
Shu-Wen Yang
Tomoki Hayashi
Tomoki Toda
Published in:
IEEE J. Sel. Top. Signal Process. (2022)
Keyphrases
</>
text to speech
emotion recognition
feature representation
speech synthesis
representation scheme
voice activity detection
speech recognition
non stationary
pattern recognition
spatial relations
image representation
spoken language
neural network
prosodic features
speech recognition errors
audio stream
database