​
Login / Signup
Buye Xu
ORCID
Publication Activity (10 Years)
Years Active: 2012-2024
Publications (10 Years): 46
Top Topics
Smoothing Algorithm
Neural Network
Noise Reduction
Speech Enhancement
Top Venues
CoRR
ICASSP
INTERSPEECH
WASPAA
</>
Publications
</>
Tsun-An Hsieh
,
Jacob Donley
,
Daniel Wong
,
Buye Xu
,
Ashutosh Pandey
On the Importance of Neural Wiener Filter for Resource Efficient Multichannel Speech Enhancement.
ICASSP
(2024)
Ravi Shankar
,
Ke Tan
,
Buye Xu
,
Anurag Kumar
A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement.
CoRR
(2024)
Tsun-An Hsieh
,
Jacob Donley
,
Daniel Wong
,
Buye Xu
,
Ashutosh Pandey
On the Importance of Neural Wiener Filter for Resource Efficient Multichannel Speech Enhancement.
CoRR
(2024)
Vahid Ahmadi Kalkhorani
,
Anurag Kumar
,
Ke Tan
,
Buye Xu
,
DeLiang Wang
Audiovisual Speaker Separation with Full- and Sub-Band Modeling in the Time-Frequency Domain.
ICASSP
(2024)
Ravi Shankar
,
Ke Tan
,
Buye Xu
,
Anurag Kumar
A Closer Look at Wav2vec2 Embeddings for On-Device Single-Channel Speech Enhancement.
ICASSP
(2024)
Ashutosh Pandey
,
Buye Xu
Decoupled Spatial and Temporal Processing for Resource Efficient Multichannel Speech Enhancement.
CoRR
(2024)
Vahid Ahmadi Kalkhorani
,
Cheng Yu
,
Anurag Kumar
,
Ke Tan
,
Buye Xu
,
DeLiang Wang
AV-CrossNet: an Audiovisual Complex Spectral Mapping Network for Speech Separation By Leveraging Narrow- and Cross-Band Modeling.
CoRR
(2024)
Hassan Taherian
,
Ashutosh Pandey
,
Daniel Wong
,
Buye Xu
,
DeLiang Wang
Leveraging Sound Localization to Improve Continuous Speaker Separation.
ICASSP
(2024)
Ashutosh Pandey
,
Sanha Lee
,
Juan Azcarreta
,
Daniel Wong
,
Buye Xu
All Neural Low-latency Directional Speech Extraction.
CoRR
(2024)
Ashutosh Pandey
,
Buye Xu
Decoupled Spatial and Temporal Processing for Resource Efficient Multichannel Speech Enhancement.
ICASSP
(2024)
Haibin Wu
,
Ke Tan
,
Buye Xu
,
Anurag Kumar
,
Daniel Wong
Rethinking complex-valued deep neural networks for monaural speech enhancement.
CoRR
(2023)
Anurag Kumar
,
Ke Tan
,
Zhaoheng Ni
,
Pranay Manocha
,
Xiaohui Zhang
,
Ethan Henderson
,
Buye Xu
Torchaudio-Squim: Reference-Less Speech Quality and Intelligibility Measures in Torchaudio.
ICASSP
(2023)
Ashutosh Pandey
,
Ke Tan
,
Buye Xu
A Simple RNN Model for Lightweight, Low-compute and Low-latency Multichannel Speech Enhancement in the Time Domain.
INTERSPEECH
(2023)
Haibin Wu
,
Ke Tan
,
Buye Xu
,
Anurag Kumar
,
Daniel Wong
Rethinking Complex-Valued Deep Neural Networks for Monaural Speech Enhancement.
INTERSPEECH
(2023)
Vahid Ahmadi Kalkhorani
,
Anurag Kumar
,
Ke Tan
,
Buye Xu
,
DeLiang Wang
Time-domain Transformer-based Audiovisual Speaker Separation.
INTERSPEECH
(2023)
Rodrigo Mira
,
Buye Xu
,
Jacob Donley
,
Anurag Kumar
,
Stavros Petridis
,
Vamsi Krishna Ithapu
,
Maja Pantic
LA-VOCE: LOW-SNR Audio-Visual Speech Enhancement Using Neural Vocoders.
ICASSP
(2023)
Kuan-Lin Chen
,
Daniel D. E. Wong
,
Ke Tan
,
Buye Xu
,
Anurag Kumar
,
Vamsi Krishna Ithapu
Leveraging Heteroscedastic Uncertainty in Learning Complex Spectral Mapping for Single-Channel Speech Enhancement.
ICASSP
(2023)
Hassan Taherian
,
Ashutosh Pandey
,
Daniel Wong
,
Buye Xu
,
DeLiang Wang
Multi-input Multi-output Complex Spectral Mapping for Speaker Separation.
INTERSPEECH
(2023)
Rodrigo Mira
,
Buye Xu
,
Jacob Donley
,
Anurag Kumar
,
Stavros Petridis
,
Vamsi Krishna Ithapu
,
Maja Pantic
LA-VocE: Low-SNR Audio-visual Speech Enhancement using Neural Vocoders.
CoRR
(2022)
Ashutosh Pandey
,
Buye Xu
,
Anurag Kumar
,
Jacob Donley
,
Paul Calamia
,
DeLiang Wang
Time-domain Ad-hoc Array Speech Enhancement Using a Triple-path Network.
INTERSPEECH
(2022)
Ashutosh Pandey
,
Buye Xu
,
Anurag Kumar
,
Jacob Donley
,
Paul Calamia
,
DeLiang Wang
TPARN: Triple-Path Attentive Recurrent Network for Time-Domain Multichannel Speech Enhancement.
ICASSP
(2022)
Efthymios Tzinis
,
Yossi Adi
,
Vamsi K. Ithapu
,
Buye Xu
,
Anurag Kumar
Continual Self-Training With Bootstrapped Remixing For Speech Enhancement.
ICASSP
(2022)
Efthymios Tzinis
,
Yossi Adi
,
Vamsi Krishna Ithapu
,
Buye Xu
,
Paris Smaragdis
,
Anurag Kumar
RemixIT: Continual self-training of speech enhancement models via bootstrapped remixing.
CoRR
(2022)
Kuan-Lin Chen
,
Daniel D. E. Wong
,
Ke Tan
,
Buye Xu
,
Anurag Kumar
,
Vamsi Krishna Ithapu
Leveraging Heteroscedastic Uncertainty in Learning Complex Spectral Mapping for Single-channel Speech Enhancement.
CoRR
(2022)
Pranay Manocha
,
Anurag Kumar
,
Buye Xu
,
Anjali Menon
,
Israel Dejene Gebru
,
Vamsi Krishna Ithapu
,
Paul Calamia
SAQAM: Spatial Audio Quality Assessment Metric.
INTERSPEECH
(2022)
Efthymios Tzinis
,
Yossi Adi
,
Vamsi K. Ithapu
,
Buye Xu
,
Paris Smaragdis
,
Anurag Kumar
RemixIT: Continual Self-Training of Speech Enhancement Models via Bootstrapped Remixing.
IEEE J. Sel. Top. Signal Process.
16 (6) (2022)
Ashutosh Pandey
,
Buye Xu
,
Anurag Kumar
,
Jacob Donley
,
Paul Calamia
,
DeLiang Wang
Multichannel Speech Enhancement Without Beamforming.
ICASSP
(2022)
Tong Xiao
,
Buye Xu
,
Chuming Zhao
Spatially Selective Active Noise Control Systems.
CoRR
(2022)
Pranay Manocha
,
Anurag Kumar
,
Buye Xu
,
Anjali Menon
,
Israel D. Gebru
,
Vamsi K. Ithapu
,
Paul Calamia
SAQAM: Spatial Audio Quality Assessment Metric.
CoRR
(2022)
Yangyang Xia
,
Buye Xu
,
Anurag Kumar
Incorporating Real-World Noisy Speech in Neural-Network-Based Speech Enhancement Systems.
ASRU
(2021)
Ke Tan
,
Buye Xu
,
Anurag Kumar
,
Eliya Nachmani
,
Yossi Adi
SAGRNN: Self-Attentive Gated RNN For Binaural Speaker Separation With Interaural Cue Preservation.
IEEE Signal Process. Lett.
28 (2021)
Ashutosh Pandey
,
Buye Xu
,
Anurag Kumar
,
Jacob Donley
,
Paul Calamia
,
DeLiang Wang
TPARN: Triple-path Attentive Recurrent Network for Time-domain Multichannel Speech Enhancement.
CoRR
(2021)
Ashutosh Pandey
,
Buye Xu
,
Anurag Kumar
,
Jacob Donley
,
Paul Calamia
,
DeLiang Wang
TADRN: Triple-Attentive Dual-Recurrent Network for Ad-hoc Array Multichannel Speech Enhancement.
CoRR
(2021)
Jonah Casebeer
,
Jacob Donley
,
Daniel Wong
,
Buye Xu
,
Anurag Kumar
NICE-Beam: Neural Integrated Covariance Estimators for Time-Varying Beamformers.
CoRR
(2021)
Pranay Manocha
,
Anurag Kumar
,
Buye Xu
,
Anjali Menon
,
Israel D. Gebru
,
Vamsi K. Ithapu
,
Paul Calamia
DPLM: A Deep Perceptual Spatial-Audio Localization Metric.
CoRR
(2021)
Efthymios Tzinis
,
Yossi Adi
,
Vamsi K. Ithapu
,
Buye Xu
,
Anurag Kumar
Continual self-training with bootstrapped remixing for speech enhancement.
CoRR
(2021)
Yangyang Xia
,
Buye Xu
,
Anurag Kumar
Incorporating Real-world Noisy Speech in Neural-network-based Speech Enhancement Systems.
CoRR
(2021)
Asutosh Pandey
,
Buye Xu
,
Anurag Kumar
,
Jacob Donley
,
Paul Calamia
,
DeLiang Wang
Multichannel Speech Enhancement without Beamforming.
CoRR
(2021)
Pranay Manocha
,
Buye Xu
,
Anurag Kumar
NORESQA: A Framework for Speech Quality Assessment using Non-Matching References.
NeurIPS
(2021)
Pranay Manocha
,
Buye Xu
,
Anurag Kumar
NORESQA - A Framework for Speech Quality Assessment using Non-Matching References.
CoRR
(2021)
Ori Kabeli
,
Yossi Adi
,
Zhenyu Tang
,
Buye Xu
,
Anurag Kumar
Online Self-Attentive Gated RNNs for Real-Time Speaker Separation.
CoRR
(2021)
Pranay Manocha
,
Anurag Kumar
,
Buye Xu
,
Anjali Menon
,
Israel D. Gebru
,
Vamsi K. Ithapu
,
Paul Calamia
DPLM: A Deep Perceptual Spatial-Audio Localization Metric.
WASPAA
(2021)
Yan Zhao
,
DeLiang Wang
,
Buye Xu
,
Tao Zhang
Monaural Speech Dereverberation Using Temporal Convolutional Networks With Self Attention.
IEEE ACM Trans. Audio Speech Lang. Process.
28 (2020)
Ke Tan
,
Buye Xu
,
Anurag Kumar
,
Eliya Nachmani
,
Yossi Adi
SAGRNN: Self-Attentive Gated RNN for Binaural Speaker Separation with Interaural Cue Preservation.
CoRR
(2020)
Yan Zhao
,
Buye Xu
,
Ritwik Giri
,
Tao Zhang
Perceptually Guided Speech Enhancement Using Deep Neural Networks.
ICASSP
(2018)
Yan Zhao
,
DeLiang Wang
,
Buye Xu
,
Tao Zhang
Late Reverberation Suppression Using Recurrent Neural Networks with Long Short-Term Memory.
ICASSP
(2018)
William S. Woods
,
Elior Hadad
,
Ivo Merks
,
Buye Xu
,
Sharon Gannot
,
Tao Zhang
A real-world recording database for ad hoc microphone arrays.
WASPAA
(2015)
Ivo Merks
,
Buye Xu
,
Tao Zhang
Design of a high order binaural microphone array for hearing aids using a rigid spherical model.
ICASSP
(2014)
Eric A. Durant
,
Jinjun Xiao
,
Buye Xu
,
Martin F. McKinney
,
Tao Zhang
Perceptually motivated ANC for hearing-impaired listeners.
WASPAA
(2013)
Srikanth Vishnubhotla
,
Jinjun Xiao
,
Buye Xu
,
Martin F. McKinney
,
Tao Zhang
Annoyance perception and modeling for hearing-impaired listeners.
ICASSP
(2012)