Audio-visual multi-channel speech separation, dereverberation and recognition.
Guinan LiJianwei YuJiajun DengXunying LiuHelen MengPublished in: CoRR (2022)
Keyphrases
- audio visual
- multi channel
- multi modal
- digit recognition
- single channel
- sound source
- visual information
- multi stream
- multimedia
- visual data
- pattern recognition
- object recognition
- audio visual speech recognition
- emotion recognition
- speaker verification
- computer vision
- action recognition
- feature extraction
- speaker identification
- low level
- image processing