Optimizing Neural Network Embeddings Using a Pair-Wise Loss for Text-Independent Speaker Verification.
Hira DhamyalTianyan ZhouBhiksha RajRita SinghPublished in: ASRU (2019)
Keyphrases
- speaker verification
- neural network
- pairwise
- multilayer perceptron
- noisy environments
- speaker recognition
- language identification
- prosodic features
- artificial neural networks
- information retrieval
- pattern recognition
- text mining
- emotion recognition
- dimensionality reduction
- keywords
- neural network model
- audio visual
- semantic information
- face verification
- image sequences