​
Login / Signup
Jie Zhang
ORCID
Publication Activity (10 Years)
Years Active: 2014-2024
Publications (10 Years): 48
Top Topics
Noise Reduction
Speech Enhancement
Wiener Filter
Sensor Networks
Top Venues
IEEE ACM Trans. Audio Speech Lang. Process.
ICASSP
CoRR
INTERSPEECH
</>
Publications
</>
Shihao Chen
,
Liping Chen
,
Jie Zhang
,
Kong-Aik Lee
,
Zhenhua Ling
,
Lirong Dai
Adversarial Speech for Voice Privacy Protection from Personalized Speech Generation.
ICASSP
(2024)
Qiushi Zhu
,
Jie Zhang
,
Yu Gu
,
Yuchen Hu
,
Lirong Dai
Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation.
AAAI
(2024)
Qing-Tian Xu
,
Jie Zhang
,
Zhen-Hua Ling
An End-to-End EEG Channel Selection Method with Residual Gumbel Softmax for Brain-Assisted Speech Enhancement.
ICASSP
(2024)
Qiushi Zhu
,
Long Zhou
,
Ziqiang Zhang
,
Shujie Liu
,
Binxing Jiao
,
Jie Zhang
,
Li-Rong Dai
,
Daxin Jiang
,
Jinyu Li
,
Furu Wei
VatLM: Visual-Audio-Text Pre-Training With Unified Masked Prediction for Speech Representation Learning.
IEEE Trans. Multim.
26 (2024)
Yichi Wang
,
Jie Zhang
,
Shihao Chen
,
Weitai Zhang
,
Zhongyi Ye
,
Xinyuan Zhou
,
Lirong Dai
A Study of Multichannel Spatiotemporal Features and Knowledge Distillation on Robust Target Speaker Extraction.
ICASSP
(2024)
Jianwei Cui
,
Yu Gu
,
Chao Weng
,
Jie Zhang
,
Liping Chen
,
Lirong Dai
Sifisinger: A High-Fidelity End-to-End Singing Voice Synthesizer Based on Source-Filter Model.
ICASSP
(2024)
Guanghui Zhang
,
Jie Zhang
,
Yan Liu
,
Haibo Hu
,
Jack Y. B. Lee
,
Vaneet Aggarwal
Adaptive Video Streaming With Automatic Quality-of-Experience Optimization.
IEEE Trans. Mob. Comput.
22 (8) (2023)
Jie Zhang
,
Qing-Tian Xu
,
Qiu-Shi Zhu
,
Zhen-Hua Ling
BASEN: Time-Domain Brain-Assisted Speech Enhancement Network with Convolutional Cross Attention in Multi-talker Conditions.
INTERSPEECH
(2023)
Jie Zhang
,
Rui Tao
,
Li-Rong Dai
A Speech Distortion Weighted Single-Channel Wiener Filter Based STFT-Domain Noise Reduction.
SSP
(2023)
Guanghui Zhang
,
Jie Zhang
,
Ke Liu
,
Jing Guo
,
Jack Y. B. Lee
,
Haibo Hu
,
Vaneet Aggarwal
DUASVS: A Mobile Data Saving Strategy in Short-Form Video Streaming.
IEEE Trans. Serv. Comput.
16 (2) (2023)
Qiu-Shi Zhu
,
Jie Zhang
,
Zi-Qiang Zhang
,
Li-Rong Dai
A Joint Speech Enhancement and Self-Supervised Representation Learning Framework for Noise-Robust Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process.
31 (2023)
Mohan Shi
,
Yuchun Shu
,
Lingyun Zuo
,
Qian Chen
,
Shiliang Zhang
,
Jie Zhang
,
Li-Rong Dai
Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction.
INTERSPEECH
(2023)
Jie Zhang
,
Rui Tao
,
Jun Du
,
Li-Rong Dai
SDW-SWF: Speech Distortion Weighted Single-Channel Wiener Filter for Noise Reduction.
IEEE ACM Trans. Audio Speech Lang. Process.
31 (2023)
Bin Gu
,
Wu Guo
,
Jie Zhang
Memory Storable Network Based Feature Aggregation for Speaker Representation Learning.
IEEE ACM Trans. Audio Speech Lang. Process.
31 (2023)
Ye-Qian Du
,
Jie Zhang
,
Xin Fang
,
Ming-Hui Wu
,
Zhouwang Yang
A Semi-Supervised Complementary Joint Training Approach for Low-Resource Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process.
31 (2023)
Bin Gu
,
Jie Zhang
,
Wu Guo
A Dynamic Convolution Framework for Session-Independent Speaker Embedding Learning.
IEEE ACM Trans. Audio Speech Lang. Process.
31 (2023)
Mohan Shi
,
Zhihao Du
,
Qian Chen
,
Fan Yu
,
Yangze Li
,
Shiliang Zhang
,
Jie Zhang
,
Li-Rong Dai
CASA-ASR: Context-Aware Speaker-Attributed ASR.
INTERSPEECH
(2023)
Haotian Wang
,
Yuxuan Xi
,
Hang Chen
,
Jun Du
,
Yan Song
,
Qing Wang
,
Hengshun Zhou
,
Chenxi Wang
,
Jiefeng Ma
,
Pengfei Hu
,
Ya Jiang
,
Shi Cheng
,
Jie Zhang
,
Yuzhe Weng
Hierarchical Audio-Visual Information Fusion with Multi-label Joint Decoding for MER 2023.
ACM Multimedia
(2023)
Jie Zhang
,
Rui Tao
,
Jun Du
,
Li-Rong Dai
Energy-Efficient Sparsity-Driven Speech Enhancement in Wireless Acoustic Sensor Networks.
IEEE ACM Trans. Audio Speech Lang. Process.
31 (2023)
Jingyuan Wang
,
Jie Zhang
,
Li-Rong Dai
Real-Time Causal Spectro-Temporal Voice Activity Detection Based on Convolutional Encoding and Residual Decoding.
INTERSPEECH
(2023)
Qiu-Shi Zhu
,
Jie Zhang
,
Zi-qiang Zhang
,
Ming-Hui Wu
,
Xin Fang
,
Li-Rong Dai
A Noise-Robust Self-Supervised Pre-Training Model Based Speech Representation Learning for Automatic Speech Recognition.
ICASSP
(2022)
Jie Zhang
,
Guanghui Zhang
,
Li-Rong Dai
Frequency-Invariant Sensor Selection for MVDR Beamforming in Wireless Acoustic Sensor Networks.
IEEE Trans. Wirel. Commun.
21 (12) (2022)
Ziqiang Zhang
,
Jie Zhang
,
Jian-Shu Zhang
,
Ming-Hui Wu
,
Xin Fang
,
Lirong Dai
Learning Contextually Fused Audio-Visual Representations For Audio-Visual Speech Recognition.
ICIP
(2022)
Xiao-Ying Zhao
,
Qiu-Shi Zhu
,
Jie Zhang
Speech Enhancement Using Self-Supervised Pre-Trained Model and Vector Quantization.
CoRR
(2022)
Qiu-Shi Zhu
,
Jie Zhang
,
Zi-qiang Zhang
,
Li-Rong Dai
Joint Training of Speech Enhancement and Self-supervised Model for Noise-robust ASR.
CoRR
(2022)
Ye-Qian Du
,
Jie Zhang
,
Qiu-Shi Zhu
,
Li-Rong Dai
,
Ming-Hui Wu
,
Xin Fang
,
Zhou-Wang Yang
A Complementary Joint Training Approach Using Unpaired Speech and Text for Low-Resource Automatic Speech Recognition.
CoRR
(2022)
Xing-Yu Chen
,
Qiu-Shi Zhu
,
Jie Zhang
,
Li-Rong Dai
Supervised and Self-Supervised Pretraining Based Covid-19 Detection Using Acoustic Breathing/Cough/Speech Signals.
ICASSP
(2022)
Xing-Yu Chen
,
Jie Zhang
,
Li-Rong Dai
Reference Microphone Selection and Low-Rank Approximation Based Multichannel Wiener Filter with Application to Speech Recognition.
ICASSP
(2022)
Jie Zhang
,
Guanghui Zhang
A Parametric Unconstrained Beamformer Based Binaural Noise Reduction for Assistive Hearing.
IEEE ACM Trans. Audio Speech Lang. Process.
30 (2022)
Qiu-Shi Zhu
,
Jie Zhang
,
Ming-Hui Wu
,
Xin Fang
,
Li-Rong Dai
An Improved Wav2Vec 2.0 Pre-Training Approach Using Enhanced Local Dependency Modeling for Speech Recognition.
Interspeech
(2021)
Jie Zhang
,
Huawei Chen
,
Li-Rong Dai
,
Richard Christian Hendriks
A Study on Reference Microphone Selection for Multi-Microphone Speech Enhancement.
IEEE ACM Trans. Audio Speech Lang. Process.
29 (2021)
Jie Zhang
Power Optimized and Power Constrained Randomized Gossip Approaches for Wireless Sensor Networks.
IEEE Wirel. Commun. Lett.
10 (2) (2021)
Jie Zhang
,
Changheng Li
Quantization-Aware Binaural MWF Based Noise Reduction Incorporating External Wireless Devices.
IEEE ACM Trans. Audio Speech Lang. Process.
29 (2021)
Jian Tang
,
Jie Zhang
,
Yan Song
,
Ian McLoughlin
,
Li-Rong Dai
Multi-Granularity Sequence Alignment Mapping for Encoder-Decoder Based End-to-End ASR.
IEEE ACM Trans. Audio Speech Lang. Process.
29 (2021)
Jie Zhang
,
Jun Du
,
Li-Rong Dai
Sensor Selection for Relative Acoustic Transfer Function Steered Linearly-Constrained Beamformers.
IEEE ACM Trans. Audio Speech Lang. Process.
29 (2021)
Jie Zhang
,
Pingping Wu
Joint Sampling Synchronization and Source Localization for Wireless Acoustic Sensor Networks.
IEEE Commun. Lett.
24 (5) (2020)
Jie Zhang
,
Andreas I. Koutrouvelis
,
Richard Heusdens
,
Richard C. Hendriks
Distributed Rate-Constrained LCMV Beamforming.
IEEE Signal Process. Lett.
26 (5) (2019)
Jie Zhang
,
Richard Heusdens
,
Richard C. Hendriks
Sensor Selection and Rate Distribution Based Beamforming in Wireless Acoustic Sensor Networks.
EUSIPCO
(2019)
Jie Zhang
,
Richard Heusdens
,
Richard Christian Hendriks
Relative Acoustic Transfer Function Estimation in Wireless Acoustic Sensor Networks.
IEEE ACM Trans. Audio Speech Lang. Process.
27 (10) (2019)
Jie Zhang
,
Sundeep Prabhakar Chepuri
,
Richard Christian Hendriks
,
Richard Heusdens
Microphone Subset Selection for MVDR Beamformer Based Noise Reduction.
IEEE ACM Trans. Audio Speech Lang. Process.
26 (3) (2018)
Jie Zhang
,
Richard Heusdens
,
Richard C. Hendriks
Rate-Distributed Binaural LCMV Beamforming for Assistive Hearing in Wireless Acoustic Sensor Networks.
SAM
(2018)
Jie Zhang
,
Richard Heusdens
,
Richard Christian Hendriks
Rate-Distributed Spatial Filtering Based Noise Reduction in Wireless Acoustic Sensor Networks.
IEEE ACM Trans. Audio Speech Lang. Process.
26 (11) (2018)
Jie Zhang
,
Richard Heusdens
,
Richard C. Hendriks
Rate-Distributed Spatial Filtering Based Noise Reduction in Wireless Acoustic Sensor Networks.
CoRR
(2017)
Cheng Pang
,
Hong Liu
,
Jie Zhang
,
Xiaofei Li
Binaural Sound Localization Based on Reverberation Weighting and Generalized Parametric Mapping.
IEEE ACM Trans. Audio Speech Lang. Process.
25 (8) (2017)
Jie Zhang
,
Sundeep Prabhakar Chepuri
,
Richard C. Hendriks
,
Richard Heusdens
Microphone Subset Selection for MVDR Beamformer Based Noise Reduction.
CoRR
(2017)
Jie Zhang
,
Richard C. Hendriks
,
Richard Heusdens
Structured total least squares based internal delay estimation for distributed microphone auto-localization.
IWAENC
(2016)
Hong Liu
,
Mengdi Yue
,
Jie Zhang
Probabilistic binaural multiple sources localization based on time-delay compensation estimator and clustering analysis.
IROS
(2016)
Hong Liu
,
Mengdi Yue
,
Jie Zhang
Bi-Direction Interaural Matching Filter and Decision Weighting Fusion for Sound Source Localization in Noisy Environments.
IEICE Trans. Inf. Syst.
(12) (2016)
Ling Chen
,
Jie Zhang
,
Guodong Chen
,
Meng Zhang
,
Hong Liu
Binaural cues estimates based on Interaural Matching Filter for sound source localization.
ROBIO
(2015)
Cheng Pang
,
Jie Zhang
,
Hong Liu
Direction of arrival estimation based on reverberation weighting and noise error estimator.
INTERSPEECH
(2015)
Jie Zhang
,
Hong Liu
Robust Acoustic Localization Via Time-Delay Compensation and Interaural Matching Filter.
IEEE Trans. Signal Process.
63 (18) (2015)
Hong Liu
,
Cheng Pang
,
Jie Zhang
Binaural sound source localization based on generalized parametric model and two-layer matching strategy in complex environments.
ICRA
(2015)
Hong Liu
,
Jie Zhang
A binaural sound source localization model based on time-delay compensation and interaural coherence.
ICASSP
(2014)
Hong Liu
,
Jie Zhang
,
Zhuo Fu
A new hierarchical binaural sound source localization method based on Interaural Matching Filter.
ICRA
(2014)
Mengdi Yue
,
Ling Chen
,
Jie Zhang
,
Hong Liu
Speaker age recognition based on isolated words by using SVM.
CCIS
(2014)