Speaker identification and localization using shuffled MFCC features and deep learning.
Mahdi BarhoushAhmed HallawaAnke SchmeinkPublished in: Int. J. Speech Technol. (2023)
Keyphrases
- speaker identification
- deep learning
- mel frequency cepstral coefficients
- unsupervised feature learning
- feature extraction
- gaussian mixture model
- feature vectors
- speech recognition
- audio features
- feature set
- speaker recognition
- pattern recognition
- feature space
- machine learning
- co occurrence
- noisy environments
- unsupervised learning
- broadcast news
- object detection
- low level
- natural language
- feature selection