Multimodal Association for Speaker Verification.
Suwon ShonJames R. GlassPublished in: INTERSPEECH (2020)
Keyphrases
- speaker verification
- audio visual
- noisy environments
- speaker recognition
- prosodic features
- multi modal
- emotion recognition
- multilayer perceptron
- pattern recognition
- visual information
- information retrieval
- object detection
- using artificial neural networks
- image data
- low level
- extracting features
- language identification
- artificial neural networks
- multimedia