One-Shot Voice Conversion with Speaker-Agnostic StarGAN.

Sefik Emre Eskimez Dimitrios Dimitriadis Ken'ichi Kumatani Robert Gmyr

Published in: Interspeech (2021)

Keyphrases

synthesized speech
prosodic features
speaker verification
speech sounds
text to speech
audio visual
speaker recognition
mel frequency cepstral coefficients
emotion recognition
speaker identification
speech recognition
automatic speech recognition
structured light
speech synthesis
speech signal
data conversion
real world
gaussian mixture model
speaker diarization
decision trees
multimedia
speaker dependent
artificial intelligence
data mining