Login / Signup
Attention-Based Speaker Embeddings for One-Shot Voice Conversion.
Tatsuma Ishihara
Daisuke Saito
Published in:
INTERSPEECH (2020)
Keyphrases
</>
prosodic features
synthesized speech
speaker verification
real time
speech recognition
audio visual
speaker recognition
visual information
visual attention
emotion recognition
information retrieval
dimensionality reduction
focus of attention