Bayesian Self-Attentive Speaker Embeddings for Text-Independent Speaker Verification.
Yingke ZhuBrian MakPublished in: IEEE ACM Trans. Audio Speech Lang. Process. (2023)
Keyphrases
- speaker verification
- speaker recognition
- noisy environments
- language identification
- prosodic features
- acoustic features
- information retrieval
- audio visual
- emotion recognition
- text mining
- speaker diarization
- multilayer perceptron
- bayesian networks
- noise reduction
- maximum likelihood
- speech recognition
- semantic information
- using artificial neural networks
- non stationary
- multi modal
- edge detection
- low level
- keywords
- image processing
- learning algorithm