​
Login / Signup
Ruoyu Wang
Publication Activity (10 Years)
Years Active: 2022-2024
Publications (10 Years): 12
Top Topics
Quantum Computing
Speech Recognition
Speaker Diarization
Audio Features
Top Venues
CoRR
ICASSP
APSIPA ASC
INTERSPEECH
</>
Publications
</>
Liang Zou
,
Genwei Yan
,
Ruoyu Wang
,
Jun Du
,
Meng Lei
,
Tian Gao
,
Xin Fang
Multitask frame-level learning for few-shot sound event detection.
CoRR
(2024)
Chang Li
,
Ruoyu Wang
,
Lijuan Liu
,
Jun Du
,
Yixuan Sun
,
Zilu Guo
,
Zhenrong Zhang
,
Yuan Jiang
Quality-aware Masked Diffusion Transformer for Enhanced Music Generation.
CoRR
(2024)
Gaobin Yang
,
Maokui He
,
Shutong Niu
,
Ruoyu Wang
,
Yanyan Yue
,
Shuangqing Qian
,
Shilong Wu
,
Jun Du
,
Chin-Hui Lee
Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding with Sequence-to-Sequence Architecture.
ICASSP
(2024)
Yusheng Dai
,
Hang Chen
,
Jun Du
,
Ruoyu Wang
,
Shihao Chen
,
Jiefeng Ma
,
Haotian Wang
,
Chin-Hui Lee
A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition.
CoRR
(2024)
Feng Ma
,
Yanhui Tu
,
Maokui He
,
Ruoyu Wang
,
Shutong Niu
,
Lei Sun
,
Zhongfu Ye
,
Jun Du
,
Jia Pan
,
Chin-Hui Lee
A Spatial Long-Term Iterative Mask Estimation Approach for Multi-Channel Speaker Diarization and Speech Recognition.
ICASSP
(2024)
Shilong Wu
,
Chenxi Wang
,
Hang Chen
,
Yusheng Dai
,
Chenyue Zhang
,
Ruoyu Wang
,
Hongbo Lan
,
Jun Du
,
Chin-Hui Lee
,
Jingdong Chen
,
Sabato Marco Siniscalchi
,
Odette Scharenborg
,
Zhong-Qiu Wang
,
Jia Pan
,
Jianqing Gao
The Multimodal Information Based Speech Processing (MISP) 2023 Challenge: Audio-Visual Target Speaker Extraction.
ICASSP
(2024)
Ruoyu Wang
,
Jun Du
,
Tian Gao
Quantum Transfer Learning Using the Large-Scale Unsupervised Pre-Trained Model Wavlm-Large for Synthetic Speech Detection.
ICASSP
(2023)
Chang Wang
,
Jun Du
,
Hang Chen
,
Ruoyu Wang
,
Chao-Han Huck Yang
,
Jiangjiang Zhao
,
Yuling Ren
,
Qinglong Li
,
Chin-Hui Lee
Enhancing Privacy Preservation with Quantum Computing for Word-Level Audio-Visual Speech Recognition.
APSIPA ASC
(2023)
Gaobin Yang
,
Maokui He
,
Shutong Niu
,
Ruoyu Wang
,
Yanyan Yue
,
Shuangqing Qian
,
Shilong Wu
,
Jun Du
,
Chin-Hui Lee
Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding with Sequence-to-Sequence Architecture.
CoRR
(2023)
Shilong Wu
,
Chenxi Wang
,
Hang Chen
,
Yusheng Dai
,
Chenyue Zhang
,
Ruoyu Wang
,
Hongbo Lan
,
Jun Du
,
Chin-Hui Lee
,
Jingdong Chen
,
Shinji Watanabe
,
Sabato Marco Siniscalchi
,
Odette Scharenborg
,
Zhong-Qiu Wang
,
Jia Pan
,
Jianqing Gao
The Multimodal Information Based Speech Processing (MISP) 2023 Challenge: Audio-Visual Target Speaker Extraction.
CoRR
(2023)
Ruoyu Wang
,
Maokui He
,
Jun Du
,
Hengshun Zhou
,
Shutong Niu
,
Hang Chen
,
Yanyan Yue
,
Gaobin Yang
,
Shilong Wu
,
Lei Sun
,
Yanhui Tu
,
Haitao Tang
,
Shuangqing Qian
,
Tian Gao
,
Mengzhi Wang
,
Genshun Wan
,
Jia Pan
,
Jianqing Gao
,
Chin-Hui Lee
The USTC-NERCSLIP Systems for the CHiME-7 DASR Challenge.
CoRR
(2023)
Guolong Zhong
,
Hongyu Song
,
Ruoyu Wang
,
Lei Sun
,
Diyuan Liu
,
Jia Pan
,
Xin Fang
,
Jun Du
,
Jie Zhang
,
Lirong Dai
External Text Based Data Augmentation for Low-Resource Speech Recognition in the Constrained Condition of OpenASR21 Challenge.
INTERSPEECH
(2022)