End-to-end speaker identification research based on multi-scale SincNet and CGAN.

Guangcun Wei Yanna Zhang Hang Min Yunfei Xu

Published in: Neural Comput. Appl. (2023)

Keyphrases

end to end
speaker identification
multiscale
speech recognition
gaussian mixture model
speech signal
noisy environments
feature extraction
broadcast news
wavelet transform
natural images
scale space
edge detection
mixture model
image segmentation
image processing
congestion control
real time
bit rate
wavelet coefficients
face recognition
e learning
neural network