The Multimodal Information Based Speech Processing (MISP) 2023 Challenge: Audio-Visual Target Speaker Extraction.
Shilong WuChenxi WangHang ChenYusheng DaiChenyue ZhangRuoyu WangHongbo LanJun DuChin-Hui LeeJingdong ChenSabato Marco SiniscalchiOdette ScharenborgZhong-Qiu WangJia PanJianqing GaoPublished in: ICASSP (2024)