Research on generalization property of time-varying Fbank-weighted MFCC for i-vector based speaker verification.
Jun WangLantian LiDong WangThomas Fang ZhengPublished in: ISCSLP (2014)
Keyphrases
- speaker verification
- speaker recognition
- noisy environments
- acoustic features
- prosodic features
- audio visual
- feature vectors
- mel frequency cepstral coefficients
- multilayer perceptron
- speech recognition
- language identification
- emotion recognition
- speaker identification
- using artificial neural networks
- speaker diarization
- face verification
- neural network
- gaussian mixture model
- data fusion
- multi modal
- pattern recognition
- multiscale
- high level