Speaker verification using attentive multi-scale convolutional recurrent network.
Yanxiong LiZhongjie JiangWenchang CaoQisheng HuangPublished in: CoRR (2023)
Keyphrases
- speaker verification
- recurrent networks
- multiscale
- recurrent neural networks
- noisy environments
- biologically inspired
- speaker recognition
- feed forward
- prosodic features
- neural network
- audio visual
- multilayer perceptron
- natural images
- visual attention
- image representation
- using artificial neural networks
- edge detection
- noise reduction
- emotion recognition
- input image
- image segmentation
- multi modal
- speech recognition
- face verification
- low level
- expert systems
- genetic algorithm