Audio-Visual End-to-End Multi-Channel Speech Separation, Dereverberation and Recognition.
Guinan LiJiajun DengMengzhe GengZengrui JinTianzi WangShujie HuMingyu CuiHelen MengXunying LiuPublished in: IEEE ACM Trans. Audio Speech Lang. Process. (2023)
Keyphrases
- audio visual
- end to end
- multi channel
- multi modal
- single channel
- visual information
- multi stream
- sound source
- visual data
- text localization and recognition
- multimedia
- congestion control
- multipath
- ad hoc networks
- multi hop
- wireless ad hoc networks
- audio features
- emotion recognition
- action recognition
- pattern recognition
- feature extraction
- activity recognition
- data management
- feature vectors
- mac protocol
- audio visual speech recognition