An Enhanced Res2Net with Local and Global Feature Fusion for Speaker Verification.
Yafeng ChenSiqi ZhengHui WangLuyao ChengQian ChenJiajun QiPublished in: INTERSPEECH (2023)
Keyphrases
- speaker verification
- feature fusion
- noisy environments
- feature extraction
- multiple features
- emotion recognition
- audio visual
- neural network
- high resolution
- multi modal
- noise reduction
- support vector
- face verification
- training data
- image processing
- fusion algorithm
- canonical correlation analysis
- single feature
- computer vision
- data mining