DeepVOX: Discovering Features from Raw Audio for Speaker Recognition in Degraded Audio Signals.
Anurag ChowdhuryArun RossPublished in: CoRR (2020)
Keyphrases
- audio signals
- speaker identification
- speaker recognition
- audio signal
- mel frequency cepstral coefficients
- feature extraction
- gaussian mixture model
- speech recognition
- probabilistic neural network
- feature vectors
- speech signal
- extracted features
- audio features
- multimedia
- feature space
- noisy environments
- music information retrieval
- feature set
- low level
- information retrieval
- self organizing maps
- wavelet transform
- acoustic features