​
Login / Signup
Qing Wang
ORCID
Publication Activity (10 Years)
Years Active: 2014-2024
Publications (10 Years): 31
Top Topics
Speech Enhancement
Doa Estimation
Information Fusion
Neural Network
Top Venues
CoRR
ISCSLP
ICASSP
IEEE ACM Trans. Audio Speech Lang. Process.
</>
Publications
</>
Jiefeng Ma
,
Yan Wang
,
Chenyu Liu
,
Jun Du
,
Yu Hu
,
Zhenrong Zhang
,
Pengfei Hu
,
Qing Wang
,
Jianshu Zhang
SRFUND: A Multi-Granularity Hierarchical Structure Reconstruction Benchmark in Form Understanding.
CoRR
(2024)
Hang Chen
,
Qing Wang
,
Jun Du
,
Bao-Cai Yin
,
Jia Pan
,
Chin-Hui Lee
Optimizing Audio-Visual Speech Enhancement Using Multi-Level Distortion Measures for Audio-Visual Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process.
32 (2024)
Hang Chen
,
Qing Wang
,
Jun Du
,
Genshun Wan
,
Shifu Xiong
,
Baocai Yin
,
Jia Pan
,
Chin-Hui Lee
Collaborative Viseme Subword and End-to-End Modeling for Word-Level Lip Reading.
IEEE Trans. Multim.
26 (2024)
Zilu Guo
,
Qing Wang
,
Jun Du
,
Jia Pan
,
Qing-Feng Liu
,
Chin-Hui Lee
A Variance-Preserving Interpolation Approach for Diffusion Models With Applications to Single Channel Speech Enhancement and Recognition.
IEEE ACM Trans. Audio Speech Lang. Process.
32 (2024)
Shutong Niu
,
Jun Du
,
Qing Wang
,
Li Chai
,
Huaxin Wu
,
Zhaoxu Nian
,
Lei Sun
,
Yi Fang
,
Jia Pan
,
Chin-Hui Lee
An Experimental Study on Sound Event Localization and Detection Under Realistic Testing Conditions.
ICASSP
(2023)
Shi Cheng
,
Jun Du
,
Qing Wang
,
Ya Jiang
,
Zhaoxu Nian
,
Shutong Niu
,
Chin-Hui Lee
,
Yu Gao
,
Wenbin Zhang
Improving Sound Event Localization and Detection with Class-Dependent Sound Separation for Real-World Scenarios.
APSIPA ASC
(2023)
Qing Wang
,
Jun Du
,
Huaxin Wu
,
Jia Pan
,
Feng Ma
,
Chin-Hui Lee
A Four-Stage Data Augmentation Approach to ResNet-Conformer Based Acoustic Modeling for Sound Event Localization and Detection.
IEEE ACM Trans. Audio Speech Lang. Process.
31 (2023)
Ya Jiang
,
Hang Chen
,
Jun Du
,
Qing Wang
,
Chin-Hui Lee
Incorporating Lip Features into Audio-Visual Multi-Speaker DOA Estimation by Gated Fusion.
ICASSP
(2023)
Haotian Wang
,
Yuxuan Xi
,
Hang Chen
,
Jun Du
,
Yan Song
,
Qing Wang
,
Hengshun Zhou
,
Chenxi Wang
,
Jiefeng Ma
,
Pengfei Hu
,
Ya Jiang
,
Shi Cheng
,
Jie Zhang
,
Yuzhe Weng
Hierarchical Audio-Visual Information Fusion with Multi-label Joint Decoding for MER 2023.
ACM Multimedia
(2023)
Qing Wang
,
Jun Du
,
Zhaoxu Nian
,
Shutong Niu
,
Li Chai
,
Huaxin Wu
,
Jia Pan
,
Chin-Hui Lee
Loss Function Design for DNN-Based Sound Event Localization and Detection on Low-Resource Realistic Data.
ICASSP
(2023)
Qing Wang
,
Jun Du
,
Siyuan Zheng
,
Yunqing Li
,
Yajian Wang
,
Yuzhong Wu
,
Hu Hu
,
Chao-Han Huck Yang
,
Sabato Marco Siniscalchi
,
Yannan Wang
,
Chin-Hui Lee
A Study on Joint Modeling and Data Augmentation of Multi-Modalities for Audio-Visual Scene Classification.
ISCSLP
(2022)
Qing Wang
,
Hang Chen
,
Ya Jiang
,
Zhe Wang
,
Yuyang Wang
,
Jun Du
,
Chin-Hui Lee
Deep Learning Based Audio-Visual Multi-Speaker DOA Estimation Using Permutation-Free Loss Function.
CoRR
(2022)
Qing Wang
,
Hang Chen
,
Ya Jiang
,
Zhe Wang
,
Yuyang Wang
,
Jun Du
,
Chin-Hui Lee
Deep Learning Based Audio-Visual Multi-Speaker DOA Estimation Using Permutation-Free Loss Function.
ISCSLP
(2022)
Yajian Wang
,
Jun Du
,
Hang Chen
,
Qing Wang
,
Chin-Hui Lee
Deep Segment Model for Acoustic Scene Classification.
INTERSPEECH
(2022)
Qing Wang
,
Jun Du
,
Siyuan Zheng
,
Yunqing Li
,
Yajian Wang
,
Yuzhong Wu
,
Hu Hu
,
Chao-Han Huck Yang
,
Sabato Marco Siniscalchi
,
Yannan Wang
,
Chin-Hui Lee
A study on joint modeling and data augmentation of multi-modalities for audio-visual scene classification.
CoRR
(2022)
Qing Wang
,
Jun Du
,
Huaxin Wu
,
Jia Pan
,
Feng Ma
,
Chin-Hui Lee
A Four-Stage Data Augmentation Approach to ResNet-Conformer Based Acoustic Modeling for Sound Event Localization and Detection.
CoRR
(2021)
Qing Wang
,
Huaxin Wu
,
Zijun Jing
,
Feng Ma
,
Yi Fang
,
Yuxuan Wang
,
Tairan Chen
,
Jia Pan
,
Jun Du
,
Chin-Hui Lee
A Model Ensemble Approach for Sound Event Localization and Detection.
ISCSLP
(2021)
Koen Oostermeijer
,
Jun Du
,
Qing Wang
,
Chin-Hui Lee
Speech Enhancement Autoencoder with Hierarchical Latent Structure.
ICASSP
(2021)
Hengshun Zhou
,
Jun Du
,
Yuanyuan Zhang
,
Qing Wang
,
Qing-Feng Liu
,
Chin-Hui Lee
Information Fusion in Attention Networks Using Adaptive and Multi-Level Factorized Bilinear Pooling for Audio-Visual Emotion Recognition.
IEEE ACM Trans. Audio Speech Lang. Process.
29 (2021)
Chao-Han Huck Yang
,
Hu Hu
,
Sabato Marco Siniscalchi
,
Qing Wang
,
Yuyang Wang
,
Xianjun Xia
,
Yuanjun Zhao
,
Yuzhong Wu
,
Yannan Wang
,
Jun Du
,
Chin-Hui Lee
A Lottery Ticket Hypothesis Framework for Low-Complexity Device-Robust Neural Acoustic Scene Classification.
CoRR
(2021)
Koen Oostermeijer
,
Qing Wang
,
Jun Du
Lightweight Causal Transformer with Local Self-Attention for Real-Time Speech Enhancement.
Interspeech
(2021)
Koen Oostermeijer
,
Qing Wang
,
Jun Du
Frequency Gating: Improved Convolutional Neural Networks for Speech Enhancement in the Time-Frequency Domain.
APSIPA
(2020)
Xin Tang
,
Jun Du
,
Li Chai
,
Yannan Wang
,
Qing Wang
,
Chin-Hui Lee
Geometry Constrained Progressive Learning for Lstm-Based Speech Enhancement.
ICASSP
(2020)
Koen Oostermeijer
,
Qing Wang
,
Jun Du
Frequency Gating: Improved Convolutional Neural Networks for Speech Enhancement in the Time-Frequency Domain.
CoRR
(2020)
Xin Tang
,
Jun Du
,
Li Chai
,
Yannan Wang
,
Qing Wang
,
Chin-Hui Lee
A LSTM-Based Joint Progressive Learning Framework for Simultaneous Speech Dereverberation and Denoising.
APSIPA
(2019)
Qing Wang
,
Jun Du
,
Li Chai
,
Li-Rong Dai
,
Chin-Hui Lee
A Maximum Likelihood Approach to Masking-based Speech Enhancement Using Deep Neural Network.
ISCSLP
(2018)
Xin Wang
,
Jun Du
,
Lei Sun
,
Qing Wang
,
Chin-Hui Lee
A Progressive Deep Learning Approach to Child Speech Separation.
ISCSLP
(2018)
Qing Wang
,
Jun Du
,
Li-Rong Dai
,
Chin-Hui Lee
A Multiobjective Learning and Ensembling Approach to High-Performance Speech Enhancement With Compact Neural Network Architectures.
IEEE ACM Trans. Audio Speech Lang. Process.
26 (7) (2018)
Qing Wang
,
Jun Du
,
Li-Rong Dai
,
Chin-Hui Lee
Joint noise and mask aware training for DNN-based speech enhancement with SUB-band features.
HSCMA
(2017)
Yanhui Tu
,
Jun Du
,
Qing Wang
,
Xiao Bao
,
Li-Rong Dai
,
Chin-Hui Lee
An information fusion framework with multi-channel feature concatenation and multi-perspective system combination for the deep-learning-based robust recognition of microphone array speech.
Comput. Speech Lang.
46 (2017)
Qing Wang
,
Jun Du
,
Li-Rong Dai
Boosting DNN-based speech enhancement via explicit transformations.
APSIPA
(2016)
Jun Du
,
Qing Wang
,
Yanhui Tu
,
Xiao Bao
,
Li-Rong Dai
,
Chin-Hui Lee
An information fusion approach to recognizing microphone array speech in the CHiME-3 challenge based on a deep learning framework.
ASRU
(2015)
Qing Wang
,
Jun Du
,
Xiao Bao
,
Zi-Rui Wang
,
Li-Rong Dai
,
Chin-Hui Lee
A universal VAD based on jointly trained deep neural networks.
INTERSPEECH
(2015)
Jun Du
,
Qing Wang
,
Tian Gao
,
Yong Xu
,
Li-Rong Dai
,
Chin-Hui Lee
Robust speech recognition with speech enhanced deep neural networks.
INTERSPEECH
(2014)