Robust speaker identification based on perceptual log area ratio and Gaussian mixture models.
David ChowWaleed H. AbdullaPublished in: INTERSPEECH (2004)
Keyphrases
- gaussian mixture model
- speaker identification
- speaker recognition
- mixture model
- probabilistic neural network
- language identification
- noisy environments
- feature vectors
- em algorithm
- expectation maximization
- maximum likelihood
- mel frequency cepstral coefficients
- speech recognition
- feature space
- probabilistic model
- low level
- image processing
- machine learning
- data points
- generative model
- unsupervised learning
- high dimensional
- broadcast news
- similarity measure
- computer vision
- neural network