Spatial-temporal Graph Based Multi-channel Speaker Verification With Ad-hoc Microphone Arrays.
Yijiang ChenChengdong LiangXiao-Lei ZhangPublished in: CoRR (2023)
Keyphrases
- spatial temporal
- multi channel
- speaker verification
- noisy environments
- action recognition
- audio visual
- spatio temporal
- spatial and temporal
- automatic speech recognition
- multilayer perceptron
- temporal information
- visual information
- video shots
- emotion recognition
- neural network
- multi modal
- language identification
- spatial information
- machine learning
- human actions
- feature selection