Login / Signup
A late fusion deep neural network for robust speaker identification using raw waveforms and gammatone cepstral coefficients.
Daniele Salvati
Carlo Drioli
Gian Luca Foresti
Published in:
Expert Syst. Appl. (2023)
Keyphrases
</>
speaker identification
neural network
noisy environments
speech signal
speech recognition
mel frequency cepstral coefficients
gaussian mixture model
speaker recognition
feature extraction
machine learning
pattern recognition
image features
image classification
multi modal
speaker diarization