Login / Signup
Hui Zhang
Publication Activity (10 Years)
Years Active: 2014-2024
Publications (10 Years): 33
Top Topics
Semantic Representations
Keyword Spotting
Document Images
Speech Enhancement
Top Venues
ICASSP
INTERSPEECH
CoRR
IALP
</>
Publications
</>
Jiahui Pan
,
Pengjie Shen
,
Hui Zhang
,
Xueliang Zhang
Efficient Multi-Channel Speech Enhancement with Spherical Harmonics Injection for Directional Encoding.
ICASSP
(2024)
Min Lu
,
Feilong Bao
,
Hui Zhang
,
Guanglai Gao
The image and ground truth dataset of Mongolian movable-type newspapers for text recognition.
Int. J. Document Anal. Recognit.
27 (2) (2024)
Tianci Wu
,
Shulin He
,
Hui Zhang
,
Xueliang Zhang
ScaleFormer: Transformer-based speech enhancement in the multi-scale time domain.
APSIPA ASC
(2023)
Jiahui Pan
,
Shuai Nie
,
Hui Zhang
,
Shulin He
,
Kanghao Zhang
,
Shan Liang
,
Xueliang Zhang
,
Jianhua Tao
Speaker recognition-assisted robust audio deepfake detection.
INTERSPEECH
(2022)
Yonghe Wang
,
Rui Liu
,
Feilong Bao
,
Hui Zhang
,
Guanglai Gao
Alignment-Learning Based Single-Step Decoding for Accurate and Fast Non-Autoregressive Speech Recognition.
ICASSP
(2022)
Yang Yang
,
Hui Zhang
,
Xueliang Zhang
,
Huaiwen Zhang
Alleviating the Loss-Metric Mismatch in Supervised Single-Channel Speech Enhancement.
ICASSP
(2022)
Yihao Wu
,
Yonghe Wang
,
Hui Zhang
,
Feilong Bao
,
Guanglai Gao
MNASR: A Free Speech Corpus For Mongolian Speech Recognition And Accompanied Baselines.
O-COCOSDA 2022
(2022)
Yonghe Wang
,
Hui Zhang
,
Feilong Bao
,
Guanglai Gao
Soft-BAC: Soft Bidirectional Alignment Cost for End-to-End Automatic Speech Recognition.
PRICAI (2)
(2021)
Xiang Hao
,
Xiangdong Su
,
Zhiyu Wang
,
Hui Zhang
,
Batushiren
UNetGAN: A Robust Speech Enhancement Approach in Time Domain for Extremely Low Signal-to-noise Ratio Condition.
CoRR
(2020)
Hongxi Wei
,
Cong Liu
,
Hui Zhang
,
Feilong Bao
,
Guanglai Gao
End-to-End Model for Offline Handwritten Mongolian Word Recognition.
NLPCC (2)
(2019)
Xiang Hao
,
Xiangdong Su
,
Zhiyu Wang
,
Hui Zhang
,
Batushiren
UNetGAN: A Robust Speech Enhancement Approach in Time Domain for Extremely Low Signal-to-Noise Ratio Condition.
INTERSPEECH
(2019)
Min Lu
,
Feilong Bao
,
Guanglai Gao
,
Weihua Wang
,
Hui Zhang
An Automatic Spelling Correction Method for Classical Mongolian.
KSEM (2)
(2019)
Yun Liu
,
Hui Zhang
,
Xueliang Zhang
,
Linju Yang
Supervised Speech Enhancement with Real Spectrum Approximation.
ICASSP
(2019)
Yun Liu
,
Hui Zhang
,
Xueliang Zhang
Using Shifted Real Spectrum Mask as Training Target for Supervised Speech Separation.
INTERSPEECH
(2018)
Rui Liu
,
Feilong Bao
,
Guanglai Gao
,
Hui Zhang
,
Yonghe Wang
Improving Mongolian Phrase Break Prediction by Using Syllable and Morphological Embeddings with BiLSTM Model.
INTERSPEECH
(2018)
Hongxi Wei
,
Hui Zhang
,
Guanglai Gao
Word Image Representation Based on Visual Embeddings and Spatial Constraints for Keyword Spotting on Historical Documents.
ICPR
(2018)
Hui Zhang
,
Xueliang Zhang
,
Guanglai Gao
Training Supervised Speech Separation System to Improve STOI and PESQ Directly.
ICASSP
(2018)
Jingdong Li
,
Hui Zhang
,
Rui Liu
,
Xueliang Zhang
,
Feilong Bao
End-to-End Mongolian Text-to-Speech System.
ISCSLP
(2018)
Rui Liu
,
Feilong Bao
,
Guanglai Gao
,
Hui Zhang
,
Yonghe Wang
A LSTM Approach with Sub-Word Embeddings for Mongolian Phrase Break Prediction.
COLING
(2018)
Rui Liu
,
Feilong Bao
,
Guanglai Gao
,
Hui Zhang
,
Yonghe Wang
Phonologically Aware BiLSTM Model for Mongolian Phrase Break Prediction with Attention Mechanism.
PRICAI (1)
(2018)
Hongxi Wei
,
Hui Zhang
,
Guanglai Gao
Integrating Visual Word Embeddings into Translation Language Model for Keyword Spotting on Historical Mongolian Document Images.
PCM (2)
(2017)
Hui Zhang
,
Xueliang Zhang
,
Guanglai Gao
Multi-Target Ensemble Learning for Monaural Speech Separation.
INTERSPEECH
(2017)
Hui Zhang
,
Hongxi Wei
,
Feilong Bao
,
Guanglai Gao
Segmentation-Free Printed Traditional Mongolian OCR Using Sequence to Sequence with Attention Model.
ICDAR
(2017)
Hao Li
,
Xueliang Zhang
,
Hui Zhang
,
Guanglai Gao
Integrated Speech Enhancement Method Based on Weighted Prediction Error and DNN for Dereverberation and Denoising.
CoRR
(2017)
Hongxi Wei
,
Hui Zhang
,
Guanglai Gao
Representing word image using visual word embeddings and RNN for keyword spotting on historical document images.
ICME
(2017)
Hongxi Wei
,
Hui Zhang
,
Guanglai Gao
,
Xiangdong Su
Using Word Mover's Distance with Spatial Constraints for Measuring Similarity Between Mongolian Word Images.
ICONIP (4)
(2017)
Hongwei Zhang
,
Feilong Bao
,
Guanglai Gao
,
Hui Zhang
Comparison on Neural Network based acoustic model in Mongolian speech recognition.
IALP
(2016)
Xueliang Zhang
,
Hui Zhang
,
Shuai Nie
,
Guanglai Gao
,
Wenju Liu
A Pairwise Algorithm Using the Deep Stacking Network for Speech Separation and Pitch Estimation.
IEEE ACM Trans. Audio Speech Lang. Process.
24 (6) (2016)
Hong Su
,
Hui Zhang
,
Xueliang Zhang
,
Guanglai Gao
Convolutional neural network for robust pitch determination.
ICASSP
(2016)
Hao Li
,
Shuai Nie
,
Xueliang Zhang
,
Hui Zhang
Jointly Optimizing Activation Coefficients of Convolutive NMF Using DNN for Speech Separation.
INTERSPEECH
(2016)
Hui Zhang
,
Xueliang Zhang
,
Guanglai Gao
Document summarization based on semantic representations.
IALP
(2015)
Hui Zhang
,
Feilong Bao
,
Guanglai Gao
Mongolian Speech Recognition Based on Deep Neural Networks.
CCL
(2015)
Hui Zhang
,
Xueliang Zhang
,
Shuai Nie
,
Guanglai Gao
,
Wenju Liu
A pairwise algorithm for pitch estimation and speech separation using deep stacking network.
ICASSP
(2015)
Xueliang Zhang
,
Hui Zhang
,
Guanglai Gao
Missing feature reconstruction methods for robust speaker identification.
EUSIPCO
(2014)
Shuai Nie
,
Hui Zhang
,
Xueliang Zhang
,
Wenju Liu
Deep stacking networks with time series for speech separation.
ICASSP
(2014)