​
Login / Signup
Yu Wu
ORCID
Publication Activity (10 Years)
Years Active: 2017-2024
Publications (10 Years): 77
Top Topics
Speech Recognition
Labeled And Unlabeled Data
Language Model
Spoken Language
Top Venues
CoRR
INTERSPEECH
ICASSP
Interspeech
</>
Publications
</>
Xun Gong
,
Yu Wu
,
Jinyu Li
,
Shujie Liu
,
Rui Zhao
,
Xie Chen
,
Yanmin Qian
Advanced Long-Content Speech Recognition With Factorized Neural Transducer.
CoRR
(2024)
Tianrui Wang
,
Long Zhou
,
Ziqiang Zhang
,
Yu Wu
,
Shujie Liu
,
Yashesh Gaur
,
Zhuo Chen
,
Jinyu Li
,
Furu Wei
VioLA: Conditional Language Models for Speech Recognition, Synthesis, and Translation.
IEEE ACM Trans. Audio Speech Lang. Process.
32 (2024)
Ziqiang Zhang
,
Sanyuan Chen
,
Long Zhou
,
Yu Wu
,
Shuo Ren
,
Shujie Liu
,
Zhuoyuan Yao
,
Xun Gong
,
Li-Rong Dai
,
Jinyu Li
,
Furu Wei
SpeechLM: Enhanced Speech Pre-Training With Unpaired Textual Data.
IEEE ACM Trans. Audio Speech Lang. Process.
32 (2024)
Xun Gong
,
Yu Wu
,
Jinyu Li
,
Shujie Liu
,
Rui Zhao
,
Xie Chen
,
Yanmin Qian
Advanced Long-Content Speech Recognition With Factorized Neural Transducer.
IEEE ACM Trans. Audio Speech Lang. Process.
32 (2024)
Xun Gong
,
Yu Wu
,
Jinyu Li
,
Shujie Liu
,
Rui Zhao
,
Xie Chen
,
Yanmin Qian
LongFNT: Long-Form Speech Recognition with Factorized Neural Transducer.
ICASSP
(2023)
Peidong Wang
,
Eric Sun
,
Jian Xue
,
Yu Wu
,
Long Zhou
,
Yashesh Gaur
,
Shujie Liu
,
Jinyu Li
LAMASSU: A Streaming Language-Agnostic Multilingual Speech Recognition and Translation Model Using Neural Transducers.
INTERSPEECH
(2023)
Tianrui Wang
,
Long Zhou
,
Ziqiang Zhang
,
Yu Wu
,
Shujie Liu
,
Yashesh Gaur
,
Zhuo Chen
,
Jinyu Li
,
Furu Wei
VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation.
CoRR
(2023)
Yuang Li
,
Yu Wu
,
Jinyu Li
,
Shujie Liu
Prompting Large Language Models for Zero-Shot Domain Adaptation in Speech Recognition.
ASRU
(2023)
Jian Wu
,
Yashesh Gaur
,
Zhuo Chen
,
Long Zhou
,
Yimeng Zhu
,
Tianrui Wang
,
Jinyu Li
,
Shujie Liu
,
Bo Ren
,
Linquan Liu
,
Yu Wu
On decoder-only architecture for speech-to-text and large language model integration.
CoRR
(2023)
Yuang Li
,
Yu Wu
,
Jinyu Li
,
Shujie Liu
Accelerating Transducers through Adjacent Token Merging.
INTERSPEECH
(2023)
Youngdo Ahn
,
Chengyi Wang
,
Yu Wu
,
Jong Won Shin
,
Shujie Liu
GRAVO: Learning to Generate Relevant Audio from Visual Features with Noisy Online Videos.
INTERSPEECH
(2023)
Guangyu Chen
,
Yu Wu
,
Shujie Liu
,
Tao Liu
,
Xiaoyong Du
,
Furu Wei
WavMark: Watermarking for Audio Generation.
CoRR
(2023)
Zhengyang Chen
,
Sanyuan Chen
,
Yu Wu
,
Yao Qian
,
Chengyi Wang
,
Shujie Liu
,
Yanmin Qian
,
Michael Zeng
Large-Scale Self-Supervised Speech Representation Learning for Automatic Speaker Verification.
ICASSP
(2022)
Yutong Chen
,
Ronglai Zuo
,
Fangyun Wei
,
Yu Wu
,
Shujie Liu
,
Brian Mak
Two-Stream Network for Sign Language Recognition and Translation.
NeurIPS
(2022)
Zhuo Chen
,
Naoyuki Kanda
,
Jian Wu
,
Yu Wu
,
Xiaofei Wang
,
Takuya Yoshioka
,
Jinyu Li
,
Sunit Sivasankaran
,
Sefik Emre Eskimez
Speech separation with large-scale self-supervised learning.
CoRR
(2022)
Junyi Ao
,
Rui Wang
,
Long Zhou
,
Chengyi Wang
,
Shuo Ren
,
Yu Wu
,
Shujie Liu
,
Tom Ko
,
Qing Li
,
Yu Zhang
,
Zhihua Wei
,
Yao Qian
,
Jinyu Li
,
Furu Wei
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing.
ACL (1)
(2022)
Sanyuan Chen
,
Yu Wu
,
Zhuo Chen
,
Jian Wu
,
Takuya Yoshioka
,
Shujie Liu
,
Jinyu Li
,
Xiangzhan Yu
Ultra Fast Speech Separation Model with Teacher Student Learning.
CoRR
(2022)
Sanyuan Chen
,
Yu Wu
,
Chengyi Wang
,
Shujie Liu
,
Zhuo Chen
,
Peidong Wang
,
Gang Liu
,
Jinyu Li
,
Jian Wu
,
Xiangzhan Yu
,
Furu Wei
Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition?
CoRR
(2022)
Naoyuki Kanda
,
Jian Wu
,
Yu Wu
,
Xiong Xiao
,
Zhong Meng
,
Xiaofei Wang
,
Yashesh Gaur
,
Zhuo Chen
,
Jinyu Li
,
Takuya Yoshioka
Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings.
INTERSPEECH
(2022)
Chengyi Wang
,
Yiming Wang
,
Yu Wu
,
Sanyuan Chen
,
Jinyu Li
,
Shujie Liu
,
Furu Wei
Supervision-Guided Codebooks for Masked Prediction in Speech Pre-training.
CoRR
(2022)
Naoyuki Kanda
,
Jian Wu
,
Yu Wu
,
Xiong Xiao
,
Zhong Meng
,
Xiaofei Wang
,
Yashesh Gaur
,
Zhuo Chen
,
Jinyu Li
,
Takuya Yoshioka
Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings.
CoRR
(2022)
Ziqiang Zhang
,
Sanyuan Chen
,
Long Zhou
,
Yu Wu
,
Shuo Ren
,
Shujie Liu
,
Zhuoyuan Yao
,
Xun Gong
,
Lirong Dai
,
Jinyu Li
,
Furu Wei
SpeechLM: Enhanced Speech Pre-Training with Unpaired Textual Data.
CoRR
(2022)
Sanyuan Chen
,
Chengyi Wang
,
Zhengyang Chen
,
Yu Wu
,
Shujie Liu
,
Zhuo Chen
,
Jinyu Li
,
Naoyuki Kanda
,
Takuya Yoshioka
,
Xiong Xiao
,
Jian Wu
,
Long Zhou
,
Shuo Ren
,
Yanmin Qian
,
Yao Qian
,
Jian Wu
,
Michael Zeng
,
Xiangzhan Yu
,
Furu Wei
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing.
IEEE J. Sel. Top. Signal Process.
16 (6) (2022)
Chengyi Wang
,
Yiming Wang
,
Yu Wu
,
Sanyuan Chen
,
Jinyu Li
,
Shujie Liu
,
Furu Wei
Supervision-Guided Codebooks for Masked Prediction in Speech Pre-training.
INTERSPEECH
(2022)
Sanyuan Chen
,
Yu Wu
,
Chengyi Wang
,
Shujie Liu
,
Daniel Tompkins
,
Zhuo Chen
,
Furu Wei
BEATs: Audio Pre-Training with Acoustic Tokenizers.
CoRR
(2022)
Chengyi Wang
,
Yu Wu
,
Sanyuan Chen
,
Shujie Liu
,
Jinyu Li
,
Yao Qian
,
Zhenglu Yang
Improving Self-Supervised Learning for Speech Recognition with Intermediate Layer Supervision.
ICASSP
(2022)
Hyungchan Song
,
Sanyuan Chen
,
Zhuo Chen
,
Yu Wu
,
Takuya Yoshioka
,
Min Tang
,
Jong Won Shin
,
Shujie Liu
Exploring WavLM on Speech Enhancement.
SLT
(2022)
Sanyuan Chen
,
Yu Wu
,
Chengyi Wang
,
Zhengyang Chen
,
Zhuo Chen
,
Shujie Liu
,
Jian Wu
,
Yao Qian
,
Furu Wei
,
Jinyu Li
,
Xiangzhan Yu
Unispeech-Sat: Universal Speech Representation Learning With Speaker Aware Pre-Training.
ICASSP
(2022)
Shuo Ren
,
Shujie Liu
,
Yu Wu
,
Long Zhou
,
Furu Wei
Speech Pre-training with Acoustic Piece.
INTERSPEECH
(2022)
Yiming Wang
,
Jinyu Li
,
Heming Wang
,
Yao Qian
,
Chengyi Wang
,
Yu Wu
Wav2vec-Switch: Contrastive Learning from Original-Noisy Speech Pairs for Robust Speech Recognition.
ICASSP
(2022)
Sanyuan Chen
,
Yu Wu
,
Chengyi Wang
,
Shujie Liu
,
Zhuo Chen
,
Peidong Wang
,
Gang Liu
,
Jinyu Li
,
Jian Wu
,
Xiangzhan Yu
,
Furu Wei
Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition?
INTERSPEECH
(2022)
Hyungchan Song
,
Sanyuan Chen
,
Zhuo Chen
,
Yu Wu
,
Takuya Yoshioka
,
Min Tang
,
Jong Won Shin
,
Shujie Liu
Exploring WavLM on Speech Enhancement.
CoRR
(2022)
Zhong Meng
,
Yashesh Gaur
,
Naoyuki Kanda
,
Jinyu Li
,
Xie Chen
,
Yu Wu
,
Yifan Gong
Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition.
CoRR
(2021)
Naoyuki Kanda
,
Guoli Ye
,
Yu Wu
,
Yashesh Gaur
,
Xiaofei Wang
,
Zhong Meng
,
Zhuo Chen
,
Takuya Yoshioka
Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone.
CoRR
(2021)
Xiong Xiao
,
Naoyuki Kanda
,
Zhuo Chen
,
Tianyan Zhou
,
Takuya Yoshioka
,
Sanyuan Chen
,
Yong Zhao
,
Gang Liu
,
Yu Wu
,
Jian Wu
,
Shujie Liu
,
Jinyu Li
,
Yifan Gong
Microsoft Speaker Diarization System for the Voxceleb Speaker Recognition Challenge 2020.
ICASSP
(2021)
Chengyi Wang
,
Yu Wu
,
Yao Qian
,
Ken'ichi Kumatani
,
Shujie Liu
,
Furu Wei
,
Michael Zeng
,
Xuedong Huang
UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data.
CoRR
(2021)
Leyang Cui
,
Yu Wu
,
Shujie Liu
,
Yue Zhang
Knowledge Enhanced Fine-Tuning for Better Handling Unseen Entities in Dialogue Generation.
EMNLP (1)
(2021)
Leyang Cui
,
Yu Wu
,
Jian Liu
,
Sen Yang
,
Yue Zhang
Template-Based Named Entity Recognition Using BART.
ACL/IJCNLP (Findings)
(2021)
Zhengyang Chen
,
Sanyuan Chen
,
Yu Wu
,
Yao Qian
,
Chengyi Wang
,
Shujie Liu
,
Yanmin Qian
,
Michael Zeng
Large-scale Self-Supervised Speech Representation Learning for Automatic Speaker Verification.
CoRR
(2021)
Leyang Cui
,
Yu Wu
,
Jian Liu
,
Sen Yang
,
Yue Zhang
Template-Based Named Entity Recognition Using BART.
CoRR
(2021)
Zhong Meng
,
Yu Wu
,
Naoyuki Kanda
,
Liang Lu
,
Xie Chen
,
Guoli Ye
,
Eric Sun
,
Jinyu Li
,
Yifan Gong
Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition.
CoRR
(2021)
Sanyuan Chen
,
Yu Wu
,
Chengyi Wang
,
Zhengyang Chen
,
Zhuo Chen
,
Shujie Liu
,
Jian Wu
,
Yao Qian
,
Furu Wei
,
Jinyu Li
,
Xiangzhan Yu
UniSpeech-SAT: Universal Speech Representation Learning with Speaker Aware Pre-Training.
CoRR
(2021)
Jian Wu
,
Zhuo Chen
,
Sanyuan Chen
,
Yu Wu
,
Takuya Yoshioka
,
Naoyuki Kanda
,
Shujie Liu
,
Jinyu Li
Investigation of Practical Aspects of Single Channel Speech Separation for ASR.
Interspeech
(2021)
Zhong Meng
,
Yu Wu
,
Naoyuki Kanda
,
Liang Lu
,
Xie Chen
,
Guoli Ye
,
Eric Sun
,
Jinyu Li
,
Yifan Gong
Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition.
Interspeech
(2021)
Sanyuan Chen
,
Yu Wu
,
Zhuo Chen
,
Jian Wu
,
Takuya Yoshioka
,
Shujie Liu
,
Jinyu Li
,
Xiangzhan Yu
Ultra Fast Speech Separation Model with Teacher Student Learning.
Interspeech
(2021)
Xie Chen
,
Yu Wu
,
Zhenghao Wang
,
Shujie Liu
,
Jinyu Li
Developing Real-Time Streaming Transformer Transducer for Speech Recognition on Large-Scale Dataset.
ICASSP
(2021)
Chengyi Wang
,
Yu Wu
,
Yao Qian
,
Ken'ichi Kumatani
,
Shujie Liu
,
Furu Wei
,
Michael Zeng
,
Xuedong Huang
UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data.
ICML
(2021)
Sanyuan Chen
,
Yu Wu
,
Zhuo Chen
,
Takuya Yoshioka
,
Shujie Liu
,
Jin-Yu Li
,
Xiangzhan Yu
Don't Shoot Butterfly with Rifles: Multi-Channel Continuous Speech Separation with Early Exit Transformer.
ICASSP
(2021)
Leyang Cui
,
Sijie Cheng
,
Yu Wu
,
Yue Zhang
On Commonsense Cues in BERT for Solving Commonsense Tasks.
ACL/IJCNLP (Findings)
(2021)
Naoyuki Kanda
,
Guoli Ye
,
Yu Wu
,
Yashesh Gaur
,
Xiaofei Wang
,
Zhong Meng
,
Zhuo Chen
,
Takuya Yoshioka
Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone.
Interspeech
(2021)
Leyang Cui
,
Yu Wu
,
Shujie Liu
,
Yue Zhang
Knowledge Enhanced Fine-Tuning for Better Handling Unseen Entities in Dialogue Generation.
CoRR
(2021)
Sanyuan Chen
,
Yu Wu
,
Zhuo Chen
,
Jian Wu
,
Jinyu Li
,
Takuya Yoshioka
,
Chengyi Wang
,
Shujie Liu
,
Ming Zhou
Continuous Speech Separation with Conformer.
ICASSP
(2021)
Jian Wu
,
Zhuo Chen
,
Sanyuan Chen
,
Yu Wu
,
Takuya Yoshioka
,
Naoyuki Kanda
,
Shujie Liu
,
Jinyu Li
Investigation of Practical Aspects of Single Channel Speech Separation for ASR.
CoRR
(2021)
Sanyuan Chen
,
Yu Wu
,
Zhuo Chen
,
Takuya Yoshioka
,
Shujie Liu
,
Jinyu Li
Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer.
CoRR
(2020)
Leyang Cui
,
Sijie Cheng
,
Yu Wu
,
Yue Zhang
Does BERT Solve Commonsense Task via Commonsense Knowledge?
CoRR
(2020)
Jinyu Li
,
Yu Wu
,
Yashesh Gaur
,
Chengyi Wang
,
Rui Zhao
,
Shujie Liu
On the Comparison of Popular End-to-End Models for Large Scale Speech Recognition.
CoRR
(2020)
Naihan Li
,
Yanqing Liu
,
Yu Wu
,
Shujie Liu
,
Sheng Zhao
,
Ming Liu
RobuTrans: A Robust Transformer-Based Text-to-Speech Model.
AAAI
(2020)
Xie Chen
,
Yu Wu
,
Zhenghao Wang
,
Shujie Liu
,
Jinyu Li
Developing Real-time Streaming Transformer Transducer for Speech Recognition on Large-scale Dataset.
CoRR
(2020)
Leyang Cui
,
Yu Wu
,
Shujie Liu
,
Yue Zhang
,
Ming Zhou
MuTual: A Dataset for Multi-Turn Dialogue Reasoning.
ACL
(2020)
Jinyu Li
,
Yu Wu
,
Yashesh Gaur
,
Chengyi Wang
,
Rui Zhao
,
Shujie Liu
On the Comparison of Popular End-to-End Models for Large Scale Speech Recognition.
INTERSPEECH
(2020)
Shuo Ren
,
Yu Wu
,
Shujie Liu
,
Ming Zhou
,
Shuai Ma
A Retrieve-and-Rewrite Initialization Method for Unsupervised Machine Translation.
ACL
(2020)
Chengyi Wang
,
Yu Wu
,
Shujie Liu
,
Ming Zhou
,
Zhenglu Yang
Curriculum Pre-training for End-to-End Speech Translation.
ACL
(2020)
Leyang Cui
,
Yu Wu
,
Shujie Liu
,
Yue Zhang
,
Ming Zhou
MuTual: A Dataset for Multi-Turn Dialogue Reasoning.
CoRR
(2020)
Yu Wu
,
Yunli Wang
,
Shujie Liu
A Dataset for Low-Resource Stylized Sequence-to-Sequence Generation.
AAAI
(2020)
Xiong Xiao
,
Naoyuki Kanda
,
Zhuo Chen
,
Tianyan Zhou
,
Takuya Yoshioka
,
Sanyuan Chen
,
Yong Zhao
,
Gang Liu
,
Yu Wu
,
Jian Wu
,
Shujie Liu
,
Jinyu Li
,
Yifan Gong
Microsoft Speaker Diarization System for the VoxCeleb Speaker Recognition Challenge 2020.
CoRR
(2020)
Chengyi Wang
,
Yu Wu
,
Shujie Liu
,
Zhenglu Yang
,
Ming Zhou
Bridging the Gap between Pre-Training and Fine-Tuning for End-to-End Speech Translation.
AAAI
(2020)
Sanyuan Chen
,
Yu Wu
,
Zhuo Chen
,
Jinyu Li
,
Chengyi Wang
,
Shujie Liu
,
Ming Zhou
Continuous Speech Separation with Conformer.
CoRR
(2020)
Chengyi Wang
,
Yu Wu
,
Shujie Liu
,
Ming Zhou
,
Zhenglu Yang
Curriculum Pre-training for End-to-End Speech Translation.
CoRR
(2020)
Chengyi Wang
,
Yu Wu
,
Yujiao Du
,
Jinyu Li
,
Shujie Liu
,
Liang Lu
,
Shuo Ren
,
Guoli Ye
,
Sheng Zhao
,
Ming Zhou
Semantic Mask for Transformer Based End-to-End Speech Recognition.
INTERSPEECH
(2020)
Chengyi Wang
,
Yu Wu
,
Liang Lu
,
Shujie Liu
,
Jinyu Li
,
Guoli Ye
,
Ming Zhou
Low Latency End-to-End Streaming Speech Recognition with a Scout Network.
INTERSPEECH
(2020)
Chengyi Wang
,
Yu Wu
,
Yujiao Du
,
Jinyu Li
,
Shujie Liu
,
Liang Lu
,
Shuo Ren
,
Guoli Ye
,
Sheng Zhao
,
Ming Zhou
Semantic Mask for Transformer based End-to-End Speech Recognition.
CoRR
(2019)
Shuo Ren
,
Yu Wu
,
Shujie Liu
,
Ming Zhou
,
Shuai Ma
Explicit Cross-lingual Pre-training for Unsupervised Machine Translation.
EMNLP/IJCNLP (1)
(2019)
Shuo Ren
,
Yu Wu
,
Shujie Liu
,
Ming Zhou
,
Shuai Ma
Explicit Cross-lingual Pre-training for Unsupervised Machine Translation.
CoRR
(2019)
Kun Zhou
,
Kai Zhang
,
Yu Wu
,
Shujie Liu
,
Jingsong Yu
Unsupervised Context Rewriting for Open Domain Conversation.
EMNLP/IJCNLP (1)
(2019)
Chengyi Wang
,
Yu Wu
,
Shujie Liu
,
Zhenglu Yang
,
Ming Zhou
Bridging the Gap between Pre-Training and Fine-Tuning for End-to-End Speech Translation.
CoRR
(2019)
Kun Zhou
,
Kai Zhang
,
Yu Wu
,
Shujie Liu
,
Jingsong Yu
Unsupervised Context Rewriting for Open Domain Conversation.
CoRR
(2019)
Dejian Yang
,
Yu Wu
,
Zhoujun Li
,
Wei Wu
,
Can Xu
Beihang at the NTCIR-13 STC-2 Task.
NTCIR
(2017)