Login / Signup

Phonetically-Aware Embeddings, Wide Residual Networks with Time-Delay Neural Networks and Self Attention Models for the 2018 NIST Speaker Recognition Evaluation.

Ignacio ViñalsDayana RibasVictoria MingoteJorge LlombartPablo GimenoAntonio MiguelAlfonso Ortega GiménezEduardo Lleida
Published in: INTERSPEECH (2019)
Keyphrases
  • speaker recognition
  • probabilistic model
  • model selection
  • non stationary
  • gaussian mixture model
  • feature extraction
  • bayesian networks
  • video sequences
  • dimensionality reduction
  • speaker identification