​
Login / Signup
Sining Sun
ORCID
Publication Activity (10 Years)
Years Active: 2015-2024
Publications (10 Years): 42
Top Topics
Attention Mechanism
Wall Street Journal Corpus
Keyword Spotting
Speech Recognition
Top Venues
CoRR
INTERSPEECH
ICASSP
Interspeech
</>
Publications
</>
Zhaoxi Mu
,
Xinyu Yang
,
Sining Sun
,
Qing Yang
Self-Supervised Disentangled Representation Learning for Robust Target Speech Extraction.
AAAI
(2024)
Wenjing Zhu
,
Sining Sun
,
Changhao Shan
,
Peng Fan
,
Qing Yang
Skipformer: A Skip-and-Recover Strategy for Efficient Speech Recognition.
CoRR
(2024)
Peikun Chen
,
Sining Sun
,
Changhao Shan
,
Qing Yang
,
Lei Xie
Streaming Decoder-Only Automatic Speech Recognition with Discrete Speech Units: A Pilot Study.
CoRR
(2024)
Peng Fan
,
Changhao Shan
,
Sining Sun
,
Qing Yang
,
Jianwei Zhang
Key Frame Mechanism for Efficient Conformer Based End-to-End Speech Recognition.
IEEE Signal Process. Lett.
30 (2023)
Zhanheng Yang
,
Sining Sun
,
Xiong Wang
,
Yike Zhang
,
Long Ma
,
Lei Xie
Two Stage Contextual Word Filtering for Context bias in Unified Streaming and Non-streaming Transducer.
CoRR
(2023)
Peng Fan
,
Changhao Shan
,
Sining Sun
,
Qing Yang
,
Jianwei Zhang
Key Frame Mechanism For Efficient Conformer Based End-to-end Speech Recognition.
CoRR
(2023)
Shubo Lv
,
Xiong Wang
,
Sining Sun
,
Long Ma
,
Lei Xie
DCCRN-KWS: An Audio Bias Based Model for Noise Robust Small-Footprint Keyword Spotting.
INTERSPEECH
(2023)
Shubo Lv
,
Xiong Wang
,
Sining Sun
,
Long Ma
,
Lei Xie
DCCRN-KWS: an audio bias based model for noise robust small-footprint keyword spotting.
CoRR
(2023)
Zhanheng Yang
,
Sining Sun
,
Xiong Wang
,
Yike Zhang
,
Long Ma
,
Lei Xie
Two Stage Contextual Word Filtering for Context Bias in Unified Streaming and Non-streaming Transducer.
INTERSPEECH
(2023)
Zhaoxi Mu
,
Xinyu Yang
,
Sining Sun
,
Qing Yang
Self-Supervised Disentangled Representation Learning for Robust Target Speech Extraction.
CoRR
(2023)
Zhanheng Yang
,
Sining Sun
,
Jin Li
,
Xiaoming Zhang
,
Xiong Wang
,
Long Ma
,
Lei Xie
CaTT-KWS: A Multi-stage Customized Keyword Spotting Framework based on Cascaded Transducer-Transformer.
CoRR
(2022)
Kun Wei
,
Yike Zhang
,
Sining Sun
,
Lei Xie
,
Long Ma
Conversational Speech Recognition by Learning Conversation-Level Characteristics.
ICASSP
(2022)
Kun Wei
,
Yike Zhang
,
Sining Sun
,
Lei Xie
,
Long Ma
Conversational Speech Recognition By Learning Conversation-level Characteristics.
CoRR
(2022)
Kun Wei
,
Yike Zhang
,
Sining Sun
,
Lei Xie
,
Long Ma
Leveraging Acoustic Contextual Representation by Audio-textual Cross-modal Learning for Conversational ASR.
INTERSPEECH
(2022)
Zhanheng Yang
,
Sining Sun
,
Jin Li
,
Xiaoming Zhang
,
Xiong Wang
,
Long Ma
,
Lei Xie
CaTT-KWS: A Multi-stage Customized Keyword Spotting Framework based on Cascaded Transducer-Transformer.
INTERSPEECH
(2022)
Kun Wei
,
Yike Zhang
,
Sining Sun
,
Lei Xie
,
Long Ma
Leveraging Acoustic Contextual Representation by Audio-textual Cross-modal Learning for Conversational ASR.
CoRR
(2022)
Xiong Wang
,
Sining Sun
,
Lei Xie
,
Long Ma
Efficient Conformer with Prob-Sparse Attention Mechanism for End-to-End Speech Recognition.
Interspeech
(2021)
Songjun Cao
,
Yueteng Kang
,
Yanzhe Fu
,
Xiaoshuo Xu
,
Sining Sun
,
Yike Zhang
,
Long Ma
Improving Streaming Transformer Based ASR Under a Framework of Self-supervised Learning.
CoRR
(2021)
Xiong Wang
,
Sining Sun
,
Lei Xie
,
Long Ma
Efficient Conformer with Prob-Sparse Attention Mechanism for End-to-EndSpeech Recognition.
CoRR
(2021)
Yuekai Zhang
,
Sining Sun
,
Long Ma
Tiny Transducer: A Highly-efficient Speech Recognition Model on Edge Devices.
CoRR
(2021)
Songjun Cao
,
Yueteng Kang
,
Yanzhe Fu
,
Xiaoshuo Xu
,
Sining Sun
,
Yike Zhang
,
Long Ma
Improving Streaming Transformer Based ASR Under a Framework of Self-Supervised Learning.
Interspeech
(2021)
Yuekai Zhang
,
Sining Sun
,
Long Ma
Tiny Transducer: A Highly-Efficient Speech Recognition Model on Edge Devices.
ICASSP
(2021)
Baiji Liu
,
Songjun Cao
,
Sining Sun
,
Weibin Zhang
,
Long Ma
Multi-head Monotonic Chunkwise Attention For Online Speech Recognition.
CoRR
(2020)
Qing Wang
,
Pengcheng Guo
,
Sining Sun
,
Lei Xie
,
John H. L. Hansen
Adversarial Regularization for End-to-End Robust Speaker Verification.
INTERSPEECH
(2019)
Pengcheng Guo
,
Sining Sun
,
Lei Xie
Unsupervised Adaptation with Adversarial Dropout Regularization for Robust Speech Recognition.
INTERSPEECH
(2019)
Xiong Wang
,
Sining Sun
,
Changhao Shan
,
Jingyong Hou
,
Lei Xie
,
Shen Li
,
Xin Lei
Adversarial Examples for Improving End-to-end Attention-based Small-footprint Keyword Spotting.
ICASSP
(2019)
Xiong Wang
,
Sining Sun
,
Lei Xie
Virtual Adversarial Training for DS-CNN Based Small-Footprint Keyword Spotting.
ASRU
(2019)
Sining Sun
,
Pengcheng Guo
,
Lei Xie
,
Mei-Yuh Hwang
Adversarial Regularization for Attention Based End-to-End Robust Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process.
27 (11) (2019)
Sining Sun
,
Shuran Zhou
,
Mei-Yuh Hwang
,
Lei Xie
,
Qin Li
,
Xin Lei
Multiple fixed beamformers with a spacial Wiener-form postfilter for far-field speech recognition.
APSIPA
(2019)
Jingyong Hou
,
Pengcheng Guo
,
Sining Sun
,
Frank K. Soong
,
Wenping Hu
,
Lei Xie
Domain Adversarial Training for Improving Keyword Spotting Performance of ESL Speech.
ICASSP
(2019)
Xiang Hao
,
Changhao Shan
,
Yong Xu
,
Sining Sun
,
Lei Xie
An Attention-based Neural Network Approach for Single Channel Speech Enhancement.
ICASSP
(2019)
Suliang Bu
,
Yunxin Zhao
,
Mei-Yuh Hwang
,
Sining Sun
A Probability Weighted Beamformer for Noise Robust ASR.
INTERSPEECH
(2018)
Sining Sun
,
Ching-Feng Yeh
,
Mari Ostendorf
,
Mei-Yuh Hwang
,
Lei Xie
Training Augmentation with Adversarial Examples for Robust Speech Recognition.
INTERSPEECH
(2018)
Qing Wang
,
Wei Rao
,
Sining Sun
,
Lei Xie
,
Eng Siong Chng
,
Haizhou Li
Unsupervised Domain Adaptation via Domain Adversarial Training for Speaker Recognition.
ICASSP
(2018)
Sining Sun
,
Ching-Feng Yeh
,
Mei-Yuh Hwang
,
Mari Ostendorf
,
Lei Xie
Domain Adversarial Training for Accented Speech Recognition.
CoRR
(2018)
Ke Wang
,
Junbo Zhang
,
Sining Sun
,
Yujun Wang
,
Fei Xiang
,
Lei Xie
Investigating Generative Adversarial Networks Based Speech Dereverberation for Robust Speech Recognition.
INTERSPEECH
(2018)
Sining Sun
,
Ching-Feng Yeh
,
Mei-Yuh Hwang
,
Mari Ostendorf
,
Lei Xie
Domain Adversarial Training for Accented Speech Recognition.
ICASSP
(2018)
Ke Wang
,
Junbo Zhang
,
Sining Sun
,
Yujun Wang
,
Fei Xiang
,
Lei Xie
Investigating Generative Adversarial Networks based Speech Dereverberation for Robust Speech Recognition.
CoRR
(2018)
Suliang Bu
,
Yunxin Zhao
,
Mei-Yuh Hwang
,
Sining Sun
A Robust Nonlinear Microphone Array Postfilter for Noise Reduction.
IWAENC
(2018)
Sining Sun
,
Ching-Feng Yeh
,
Mari Ostendorf
,
Mei-Yuh Hwang
,
Lei Xie
Training Augmentation with Adversarial Examples for Robust Speech Recognition.
CoRR
(2018)
Sining Sun
,
Binbin Zhang
,
Lei Xie
,
Yanning Zhang
An unsupervised deep domain adaptation approach for robust speech recognition.
Neurocomputing
257 (2017)
Chenglin Xu
,
Xiong Xiao
,
Sining Sun
,
Wei Rao
,
Eng Siong Chng
,
Haizhou Li
Weighted Spatial Covariance Matrix Estimation for MUSIC Based TDOA Estimation of Speech Source.
INTERSPEECH
(2017)
Jingyong Hou
,
Van Tung Pham
,
Cheung-Chi Leung
,
Lei Wang
,
Haihua Xu
,
Hang Lv
,
Lei Xie
,
Zhonghua Fu
,
Chongjia Ni
,
Xiong Xiao
,
Hongjie Chen
,
Shaofei Zhang
,
Sining Sun
,
Yougen Yuan
,
Pengcheng Li
,
Tin Lay Nwe
,
Sunil Sivadas
,
Bin Ma
,
Engsiong Chng
,
Haizhou Li
The NNI Query-by-Example System for MediaEval 2015.
MediaEval
(2015)