Optimizing Neural Network Embeddings Using a Pair-Wise Loss for Text-Independent Speaker Verification.

Hira Dhamyal Tianyan Zhou Bhiksha Raj Rita Singh

Published in: ASRU (2019)

Keyphrases

speaker verification
neural network
pairwise
multilayer perceptron
noisy environments
speaker recognition
language identification
prosodic features
artificial neural networks
information retrieval
pattern recognition
text mining
emotion recognition
dimensionality reduction
keywords
neural network model
audio visual
semantic information
face verification
image sequences