One-Shot Voice Conversion with Speaker-Agnostic StarGAN.
Sefik Emre EskimezDimitrios DimitriadisKen'ichi KumataniRobert GmyrPublished in: Interspeech (2021)
Keyphrases
- synthesized speech
- prosodic features
- speaker verification
- speech sounds
- text to speech
- audio visual
- speaker recognition
- mel frequency cepstral coefficients
- emotion recognition
- speaker identification
- speech recognition
- automatic speech recognition
- structured light
- speech synthesis
- speech signal
- data conversion
- real world
- gaussian mixture model
- speaker diarization
- decision trees
- multimedia
- speaker dependent
- artificial intelligence
- data mining