End-to-end speaker identification research based on multi-scale SincNet and CGAN.
Guangcun WeiYanna ZhangHang MinYunfei XuPublished in: Neural Comput. Appl. (2023)
Keyphrases
- end to end
- speaker identification
- multiscale
- speech recognition
- gaussian mixture model
- speech signal
- noisy environments
- feature extraction
- broadcast news
- wavelet transform
- natural images
- scale space
- edge detection
- mixture model
- image segmentation
- image processing
- congestion control
- real time
- bit rate
- wavelet coefficients
- face recognition
- e learning
- neural network