Login / Signup
Chenda Li
ORCID
Publication Activity (10 Years)
Years Active: 2019-2024
Publications (10 Years): 42
Top Topics
Speech Enhancement
Single Channel
Future Plans
Noisy Environments
Top Venues
CoRR
ICASSP
INTERSPEECH
SLT
</>
Publications
</>
Yihan Wu
,
Soumi Maiti
,
Yifan Peng
,
Wangyou Zhang
,
Chenda Li
,
Yuyue Wang
,
Xihua Wang
,
Shinji Watanabe
,
Ruihua Song
SpeechComposer: Unifying Multiple Speech Tasks with Prompt Composition.
CoRR
(2024)
Jiahong Li
,
Chenda Li
,
Yifei Wu
,
Yanmin Qian
Unified Cross-Modal Attention: Robust Audio-Visual Speech Recognition and Beyond.
IEEE ACM Trans. Audio Speech Lang. Process.
32 (2024)
Chenda Li
,
Samuele Cornell
,
Shinji Watanabe
,
Yanmin Qian
Diffusion-based Generative Modeling with Discriminative Guidance for Streamable Speech Enhancement.
CoRR
(2024)
Wangyou Zhang
,
Kohei Saijo
,
Jee-weon Jung
,
Chenda Li
,
Shinji Watanabe
,
Yanmin Qian
Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enhancement.
CoRR
(2024)
Wangyou Zhang
,
Robin Scheibler
,
Kohei Saijo
,
Samuele Cornell
,
Chenda Li
,
Zhaoheng Ni
,
Anurag Kumar
,
Jan Pirklbauer
,
Marvin Sach
,
Shinji Watanabe
,
Tim Fingscheidt
,
Yanmin Qian
URGENT Challenge: Universality, Robustness, and Generalizability For Speech Enhancement.
CoRR
(2024)
Chenda Li
,
Yao Qian
,
Zhuo Chen
,
Naoyuki Kanda
,
Dongmei Wang
,
Takuya Yoshioka
,
Yanmin Qian
,
Michael Zeng
Adapting Multi-Lingual ASR Models for Handling Multiple Talkers.
CoRR
(2023)
Yen-Ju Lu
,
Xuankai Chang
,
Chenda Li
,
Wangyou Zhang
,
Samuele Cornell
,
Zhaoheng Ni
,
Yoshiki Masuyama
,
Brian Yan
,
Robin Scheibler
,
Zhong-Qiu Wang
,
Yu Tsao
,
Yanmin Qian
,
Shinji Watanabe
Software Design and User Interface of ESPnet-SE++: Speech Enhancement for Robust Speech Processing.
J. Open Source Softw.
8 (91) (2023)
Chenda Li
,
Yao Qian
,
Zhuo Chen
,
Naoyuki Kanda
,
Dongmei Wang
,
Takuya Yoshioka
,
Yanmin Qian
,
Michael Zeng
Adapting Multi-Lingual ASR Models for Handling Multiple Talkers.
INTERSPEECH
(2023)
Chenda Li
,
Yao Qian
,
Zhuo Chen
,
Dongmei Wang
,
Takuya Yoshioka
,
Shujie Liu
,
Yanmin Qian
,
Michael Zeng
Target Sound Extraction with Variable Cross-Modality Clues.
ICASSP
(2023)
Linfeng Yu
,
Wangyou Zhang
,
Chenda Li
,
Yanmin Qian
Overlap Aware Continuous Speech Separation without Permutation Invariant Training.
INTERSPEECH
(2023)
Jiahong Li
,
Chenda Li
,
Yifei Wu
,
Yanmin Qian
Robust Audio-Visual ASR with Unified Cross-Modal Attention.
ICASSP
(2023)
Chenda Li
,
Yifei Wu
,
Yanmin Qian
Predictive Skim: Contrastive Predictive Coding for Low-Latency Online Speech Separation.
ICASSP
(2023)
Yifei Wu
,
Chenda Li
,
Yanmin Qian
Light-Weight Visualvoice: Neural Network Quantization On Audio Visual Speech Separation.
ICASSP Workshops
(2023)
Chenda Li
,
Yao Qian
,
Zhuo Chen
,
Dongmei Wang
,
Takuya Yoshioka
,
Shujie Liu
,
Yanmin Qian
,
Michael Zeng
Target Sound Extraction with Variable Cross-modality Clues.
CoRR
(2023)
Yen-Ju Lu
,
Samuele Cornell
,
Xuankai Chang
,
Wangyou Zhang
,
Chenda Li
,
Zhaoheng Ni
,
Zhong-Qiu Wang
,
Shinji Watanabe
Towards Low-Distortion Multi-Channel Speech Enhancement: The ESPNET-Se Submission to the L3DAS22 Challenge.
ICASSP
(2022)
Chenda Li
,
Zhuo Chen
,
Yanmin Qian
Dual-Path Modeling With Memory Embedding Model for Continuous Speech Separation.
IEEE ACM Trans. Audio Speech Lang. Process.
30 (2022)
Yen-Ju Lu
,
Xuankai Chang
,
Chenda Li
,
Wangyou Zhang
,
Samuele Cornell
,
Zhaoheng Ni
,
Yoshiki Masuyama
,
Brian Yan
,
Robin Scheibler
,
Zhong-Qiu Wang
,
Yu Tsao
,
Yanmin Qian
,
Shinji Watanabe
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding.
CoRR
(2022)
Yen-Ju Lu
,
Xuankai Chang
,
Chenda Li
,
Wangyou Zhang
,
Samuele Cornell
,
Zhaoheng Ni
,
Yoshiki Masuyama
,
Brian Yan
,
Robin Scheibler
,
Zhong-Qiu Wang
,
Yu Tsao
,
Yanmin Qian
,
Shinji Watanabe
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding.
INTERSPEECH
(2022)
Yen-Ju Lu
,
Samuele Cornell
,
Xuankai Chang
,
Wangyou Zhang
,
Chenda Li
,
Zhaoheng Ni
,
Zhong-Qiu Wang
,
Shinji Watanabe
Towards Low-distortion Multi-channel Speech Enhancement: The ESPNet-SE Submission to The L3DAS22 Challenge.
CoRR
(2022)
Bowen Qu
,
Chenda Li
,
Jinfeng Bai
,
Yanmin Qian
Improving Speech Separation with Knowledge Distilled from Self-supervised Pre-trained Models.
ISCSLP
(2022)
Wei Wang
,
Xun Gong
,
Yifei Wu
,
Zhikai Zhou
,
Chenda Li
,
Wangyou Zhang
,
Bing Han
,
Yanmin Qian
The Sjtu System For Multimodal Information Based Speech Processing Challenge 2021.
ICASSP
(2022)
Chenda Li
,
Lei Yang
,
Weiqin Wang
,
Yanmin Qian
SkiM: Skipping Memory LSTM for Low-Latency Real-Time Continuous Speech Separation.
CoRR
(2022)
Chenda Li
,
Lei Yang
,
Weiqin Wang
,
Yanmin Qian
Skim: Skipping Memory Lstm for Low-Latency Real-Time Continuous Speech Separation.
ICASSP
(2022)
Yifei Wu
,
Chenda Li
,
Jinfeng Bai
,
Zhongqin Wu
,
Yanmin Qian
Time-Domain Audio-Visual Speech Separation on Low Quality Videos.
ICASSP
(2022)
Wangyou Zhang
,
Jing Shi
,
Chenda Li
,
Shinji Watanabe
,
Yanmin Qian
Closing the Gap Between Time-Domain Multi-Channel Speech Enhancement on Real and Simulation Conditions.
WASPAA
(2021)
Pengcheng Guo
,
Florian Boyer
,
Xuankai Chang
,
Tomoki Hayashi
,
Yosuke Higuchi
,
Hirofumi Inaguma
,
Naoyuki Kamo
,
Chenda Li
,
Daniel Garcia-Romero
,
Jiatong Shi
,
Jing Shi
,
Shinji Watanabe
,
Kun Wei
,
Wangyou Zhang
,
Yuekai Zhang
Recent Developments on Espnet Toolkit Boosted By Conformer.
ICASSP
(2021)
Wangyou Zhang
,
Jing Shi
,
Chenda Li
,
Shinji Watanabe
,
Yanmin Qian
Closing the Gap Between Time-Domain Multi-Channel Speech Enhancement on Real and Simulation Conditions.
CoRR
(2021)
Yi Luo
,
Zhuo Chen
,
Cong Han
,
Chenda Li
,
Tianyan Zhou
,
Nima Mesgarani
Rethinking The Separation Layers In Speech Separation Networks.
ICASSP
(2021)
Yifei Wu
,
Chenda Li
,
Song Yang
,
Zhongqin Wu
,
Yanmin Qian
Audio-Visual Multi-Talker Speech Recognition in a Cocktail Party.
Interspeech
(2021)
Chenda Li
,
Zhuo Chen
,
Yi Luo
,
Cong Han
,
Tianyan Zhou
,
Keisuke Kinoshita
,
Marc Delcroix
,
Shinji Watanabe
,
Yanmin Qian
Dual-Path Modeling for Long Recording Speech Separation in Meetings.
ICASSP
(2021)
Chenda Li
,
Zhuo Chen
,
Yi Luo
,
Cong Han
,
Tianyan Zhou
,
Keisuke Kinoshita
,
Marc Delcroix
,
Shinji Watanabe
,
Yanmin Qian
Dual-Path Modeling for Long Recording Speech Separation in Meetings.
CoRR
(2021)
Chenda Li
,
Jing Shi
,
Wangyou Zhang
,
Aswin Shanmugam Subramanian
,
Xuankai Chang
,
Naoyuki Kamo
,
Moto Hira
,
Tomoki Hayashi
,
Christoph Böddeker
,
Zhuo Chen
,
Shinji Watanabe
ESPnet-SE: End-To-End Speech Enhancement and Separation Toolkit Designed for ASR Integration.
SLT
(2021)
Chenda Li
,
Yi Luo
,
Cong Han
,
Jinyu Li
,
Takuya Yoshioka
,
Tianyan Zhou
,
Marc Delcroix
,
Keisuke Kinoshita
,
Christoph Böddeker
,
Yanmin Qian
,
Shinji Watanabe
,
Zhuo Chen
Dual-Path RNN for Long Recording Speech Separation.
SLT
(2021)
Cong Han
,
Yi Luo
,
Chenda Li
,
Tianyan Zhou
,
Keisuke Kinoshita
,
Shinji Watanabe
,
Marc Delcroix
,
Hakan Erdogan
,
John R. Hershey
,
Nima Mesgarani
,
Zhuo Chen
Continuous Speech Separation Using Speaker Inventory for Long Recording.
Interspeech
(2021)
Chenda Li
,
Yanmin Qian
Listen, Watch and Understand at the Cocktail Party: Audio-Visual-Contextual Speech Separation.
INTERSPEECH
(2020)
Chenda Li
,
Yanmin Qian
Deep Audio-Visual Speech Separation with Attention Mechanism.
ICASSP
(2020)
Pengcheng Guo
,
Florian Boyer
,
Xuankai Chang
,
Tomoki Hayashi
,
Yosuke Higuchi
,
Hirofumi Inaguma
,
Naoyuki Kamo
,
Chenda Li
,
Daniel Garcia-Romero
,
Jiatong Shi
,
Jing Shi
,
Shinji Watanabe
,
Kun Wei
,
Wangyou Zhang
,
Yuekai Zhang
Recent Developments on ESPnet Toolkit Boosted by Conformer.
CoRR
(2020)
Yi Luo
,
Zhuo Chen
,
Cong Han
,
Chenda Li
,
Tianyan Zhou
,
Nima Mesgarani
Rethinking the Separation Layers in Speech Separation Networks.
CoRR
(2020)
Chenda Li
,
Jing Shi
,
Wangyou Zhang
,
Aswin Shanmugam Subramanian
,
Xuankai Chang
,
Naoyuki Kamo
,
Moto Hira
,
Tomoki Hayashi
,
Christoph Böddeker
,
Zhuo Chen
,
Shinji Watanabe
ESPnet-se: end-to-end speech enhancement and separation toolkit designed for asr integration.
CoRR
(2020)
Shinji Watanabe
,
Florian Boyer
,
Xuankai Chang
,
Pengcheng Guo
,
Tomoki Hayashi
,
Yosuke Higuchi
,
Takaaki Hori
,
Wen-Chin Huang
,
Hirofumi Inaguma
,
Naoyuki Kamo
,
Shigeki Karita
,
Chenda Li
,
Jing Shi
,
Aswin Shanmugam Subramanian
,
Wangyou Zhang
The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans.
CoRR
(2020)
Cong Han
,
Yi Luo
,
Chenda Li
,
Tianyan Zhou
,
Keisuke Kinoshita
,
Shinji Watanabe
,
Marc Delcroix
,
Hakan Erdogan
,
John R. Hershey
,
Nima Mesgarani
,
Zhuo Chen
Continuous Speech Separation Using Speaker Inventory for Long Multi-talker Recording.
CoRR
(2020)
Chenda Li
,
Yanmin Qian
Prosody Usage Optimization for Children Speech Recognition with Zero Resource Children Speech.
INTERSPEECH
(2019)