Login / Signup
Shuai Zhang
ORCID
Publication Activity (10 Years)
Years Active: 2019-2024
Publications (10 Years): 30
Top Topics
Language Model
Adaboost Classifier
Speech Recognition
Autoregressive
Top Venues
CoRR
INTERSPEECH
ICASSP
Interspeech
</>
Publications
</>
Xinxin Zheng
,
Feihu Che
,
Jinyang Wu
,
Shuai Zhang
,
Shuai Nie
,
Kang Liu
,
Jianhua Tao
KS-LLM: Knowledge Selection of Large Language Models with Evidence Document for Question Answering.
CoRR
(2024)
Ruibo Fu
,
Xin Qi
,
Zhengqi Wen
,
Jianhua Tao
,
Tao Wang
,
Chunyu Qiang
,
Zhiyong Wang
,
Yi Lu
,
Xiaopeng Wang
,
Shuchen Shi
,
Yukun Liu
,
Xuefei Liu
,
Shuai Zhang
ASRRL-TTS: Agile Speaker Representation Reinforcement Learning for Text-to-Speech Speaker Adaptation.
CoRR
(2024)
Ruibo Fu
,
Shuchen Shi
,
Hongming Guo
,
Tao Wang
,
Chunyu Qiang
,
Zhengqi Wen
,
Jianhua Tao
,
Xin Qi
,
Yi Lu
,
Xiaopeng Wang
,
Zhiyong Wang
,
Yukun Liu
,
Xuefei Liu
,
Shuai Zhang
,
Guanjun Li
MINT: a Multi-modal Image and Narrative Text Dubbing Dataset for Foley Audio Content Planning and Generation.
CoRR
(2024)
Ruihan Jin
,
Ruibo Fu
,
Zhengqi Wen
,
Shuai Zhang
,
Yukun Liu
,
Jianhua Tao
Fake News Detection and Manipulation Reasoning via Large Vision-Language Models.
CoRR
(2024)
Chenglong Wang
,
Jiangyan Yi
,
Jianhua Tao
,
Chu Yuan Zhang
,
Shuai Zhang
,
Xun Chen
Detection of Cross-Dataset Fake Audio Based on Prosodic and Pronunciation Features.
INTERSPEECH
(2023)
Chenglong Wang
,
Jiangyan Yi
,
Jianhua Tao
,
Chu Yuan Zhang
,
Shuai Zhang
,
Ruibo Fu
,
Xun Chen
TO-Rawnet: Improving RawNet with TCN and Orthogonal Regularization for Fake Audio Detection.
INTERSPEECH
(2023)
Shuai Zhang
,
Jiangyan Yi
,
Zhengkun Tian
,
Jianhua Tao
,
Yu Ting Yeung
,
Liqun Deng
Reducing language context confusion for end-to-end code-switching automatic speech recognition.
CoRR
(2022)
Zhengkun Tian
,
Jiangyan Yi
,
Jianhua Tao
,
Shuai Zhang
,
Zhengqi Wen
Hybrid Autoregressive and Non-Autoregressive Transformer Models for Speech Recognition.
IEEE Signal Process. Lett.
29 (2022)
Shuai Zhang
,
Jiangyan Yi
,
Zhengkun Tian
,
Jianhua Tao
,
Yu Ting Yeung
,
Liqun Deng
reducing multilingual context confusion for end-to-end code-switching automatic speech recognition.
INTERSPEECH
(2022)
Jiangyan Yi
,
Ruibo Fu
,
Jianhua Tao
,
Shuai Nie
,
Haoxin Ma
,
Chenglong Wang
,
Tao Wang
,
Zhengkun Tian
,
Ye Bai
,
Cunhang Fan
,
Shan Liang
,
Shiming Wang
,
Shuai Zhang
,
Xinrui Yan
,
Le Xu
,
Zhengqi Wen
,
Haizhou Li
ADD 2022: the first Audio Deep Synthesis Detection Challenge.
ICASSP
(2022)
Jiangyan Yi
,
Ruibo Fu
,
Jianhua Tao
,
Shuai Nie
,
Haoxin Ma
,
Chenglong Wang
,
Tao Wang
,
Zhengkun Tian
,
Ye Bai
,
Cunhang Fan
,
Shan Liang
,
Shiming Wang
,
Shuai Zhang
,
Xinrui Yan
,
Le Xu
,
Zhengqi Wen
,
Haizhou Li
,
Zheng Lian
,
Bin Liu
ADD 2022: the First Audio Deep Synthesis Detection Challenge.
CoRR
(2022)
Ye Bai
,
Jiangyan Yi
,
Jianhua Tao
,
Zhengkun Tian
,
Zhengqi Wen
,
Shuai Zhang
Fast End-to-End Speech Recognition via a Non-Autoregressive Model and Cross-Modal Knowledge Transferring from BERT.
CoRR
(2021)
Ye Bai
,
Jiangyan Yi
,
Jianhua Tao
,
Zhengqi Wen
,
Zhengkun Tian
,
Shuai Zhang
Integrating Knowledge Into End-to-End Speech Recognition From External Text-Only Data.
IEEE ACM Trans. Audio Speech Lang. Process.
29 (2021)
Zhengkun Tian
,
Jiangyan Yi
,
Ye Bai
,
Jianhua Tao
,
Shuai Zhang
,
Zhengqi Wen
FSR: Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip Regularization.
Interspeech
(2021)
Zhengkun Tian
,
Jiangyan Yi
,
Jianhua Tao
,
Ye Bai
,
Shuai Zhang
,
Zhengqi Wen
,
Xuefei Liu
TSNAT: Two-Step Non-Autoregressvie Transformer Models for Speech Recognition.
CoRR
(2021)
Zhengkun Tian
,
Jiangyan Yi
,
Ye Bai
,
Jianhua Tao
,
Shuai Zhang
,
Zhengqi Wen
One In A Hundred: Selecting the Best Predicted Sequence from Numerous Candidates for Speech Recognition.
APSIPA ASC
(2021)
Ye Bai
,
Jiangyan Yi
,
Jianhua Tao
,
Zhengkun Tian
,
Zhengqi Wen
,
Shuai Zhang
Fast End-to-End Speech Recognition Via Non-Autoregressive Models and Cross-Modal Knowledge Transferring From BERT.
IEEE ACM Trans. Audio Speech Lang. Process.
29 (2021)
Zhengkun Tian
,
Jiangyan Yi
,
Ye Bai
,
Jianhua Tao
,
Shuai Zhang
,
Zhengqi Wen
FSR: Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip Regularization.
CoRR
(2021)
Shuai Zhang
,
Jiangyan Yi
,
Zhengkun Tian
,
Ye Bai
,
Jianhua Tao
,
Xuefei Liu
,
Zhengqi Wen
End-to-End Spelling Correction Conditioned on Acoustic Feature for Code-Switching Speech Recognition.
Interspeech
(2021)
Shuai Zhang
,
Jiangyan Yi
,
Zhengkun Tian
,
Ye Bai
,
Jianhua Tao
,
Zhengqi Wen
Decoupling Pronunciation and Language for End-to-End Code-Switching Automatic Speech Recognition.
ICASSP
(2021)
Shuai Zhang
,
Jiangyan Yi
,
Zhengkun Tian
,
Jianhua Tao
,
Ye Bai
Rnn-transducer With Language Bias For End-to-end Mandarin-English Code-switching Speech Recognition.
ISCSLP
(2021)
Ye Bai
,
Jiangyan Yi
,
Jianhua Tao
,
Zhengkun Tian
,
Zhengqi Wen
,
Shuai Zhang
Listen Attentively, and Spell Once: Whole Sentence Generation via a Non-Autoregressive Architecture for Low-Latency Speech Recognition.
INTERSPEECH
(2020)
Zhengkun Tian
,
Jiangyan Yi
,
Jianhua Tao
,
Ye Bai
,
Shuai Zhang
,
Zhengqi Wen
Spike-Triggered Non-Autoregressive Transformer for End-to-End Speech Recognition.
CoRR
(2020)
Shuai Zhang
,
Jiangyan Yi
,
Zhengkun Tian
,
Jianhua Tao
,
Ye Bai
Rnn-transducer with language bias for end-to-end Mandarin-English code-switching speech recognition.
CoRR
(2020)
Zhengkun Tian
,
Jiangyan Yi
,
Ye Bai
,
Jianhua Tao
,
Shuai Zhang
,
Zhengqi Wen
Synchronous Transformers for end-to-end Speech Recognition.
ICASSP
(2020)
Shuai Zhang
,
Jiangyan Yi
,
Zhengkun Tian
,
Ye Bai
,
Jianhua Tao
,
Zhengqi Wen
Decoupling Pronunciation and Language for End-to-end Code-switching Automatic Speech Recognition.
CoRR
(2020)
Zhengkun Tian
,
Jiangyan Yi
,
Jianhua Tao
,
Ye Bai
,
Shuai Zhang
,
Zhengqi Wen
Spike-Triggered Non-Autoregressive Transformer for End-to-End Speech Recognition.
INTERSPEECH
(2020)
Ye Bai
,
Jiangyan Yi
,
Jianhua Tao
,
Zhengkun Tian
,
Zhengqi Wen
,
Shuai Zhang
Listen Attentively, and Spell Once: Whole Sentence Generation via a Non-Autoregressive Architecture for Low-Latency Speech Recognition.
CoRR
(2020)
Ye Bai
,
Jiangyan Yi
,
Jianhua Tao
,
Zhengkun Tian
,
Zhengqi Wen
,
Shuai Zhang
Integrating Whole Context to Sequence-to-sequence Speech Recognition.
CoRR
(2019)
Zhengkun Tian
,
Jiangyan Yi
,
Ye Bai
,
Jianhua Tao
,
Shuai Zhang
,
Zhengqi Wen
Synchronous Transformers for End-to-End Speech Recognition.
CoRR
(2019)