Audio-visual Multi-channel Recognition of Overlapped Speech.
Jianwei YuBo WuRongzhi GuShi-Xiong ZhangLianwu ChenYong XuMeng YuDan SuDong YuXunying LiuHelen MengPublished in: CoRR (2020)
Keyphrases
- audio visual
- multi channel
- multi modal
- digit recognition
- visual information
- multi stream
- single channel
- emotion recognition
- visual data
- pattern recognition
- multimedia
- feature extraction
- object recognition
- speaker verification
- audio visual speech recognition
- action recognition
- audio features
- natural language processing
- sound source
- feature vectors
- spatio temporal