​
Login / Signup
Jian Wu
Publication Activity (10 Years)
Years Active: 2021-2024
Publications (10 Years): 23
Top Topics
Speech Recognition
Speaker Diarization
Language Model
Selective Attention
Top Venues
CoRR
ICASSP
Interspeech
INTERSPEECH
</>
Publications
</>
Jian Wu
,
Naoyuki Kanda
,
Takuya Yoshioka
,
Rui Zhao
,
Zhuo Chen
,
Jinyu Li
T-SOT FNT: Streaming Multi-Talker ASR with Text-Only Domain Adaptation Capability.
ICASSP
(2024)
Jing Pan
,
Jian Wu
,
Yashesh Gaur
,
Sunit Sivasankaran
,
Zhuo Chen
,
Shujie Liu
,
Jinyu Li
COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning.
CoRR
(2023)
Jian Wu
,
Yashesh Gaur
,
Zhuo Chen
,
Long Zhou
,
Yimeng Zhu
,
Tianrui Wang
,
Jinyu Li
,
Shujie Liu
,
Bo Ren
,
Linquan Liu
,
Yu Wu
On Decoder-Only Architecture For Speech-to-Text and Large Language Model Integration.
ASRU
(2023)
Dongmei Wang
,
Xiong Xiao
,
Naoyuki Kanda
,
Takuya Yoshioka
,
Jian Wu
Target Speaker Voice Activity Detection with Transformers and Its Integration with End-To-End Neural Diarization.
ICASSP
(2023)
Jian Wu
,
Naoyuki Kanda
,
Takuya Yoshioka
,
Rui Zhao
,
Zhuo Chen
,
Jinyu Li
t-SOT FNT: Streaming Multi-talker ASR with Text-only Domain Adaptation Capability.
CoRR
(2023)
Muqiao Yang
,
Naoyuki Kanda
,
Xiaofei Wang
,
Jian Wu
,
Sunit Sivasankaran
,
Zhuo Chen
,
Jinyu Li
,
Takuya Yoshioka
Simulating Realistic Speech Overlaps Improves Multi-Talker ASR.
ICASSP
(2023)
Zhuo Chen
,
Naoyuki Kanda
,
Jian Wu
,
Yu Wu
,
Xiaofei Wang
,
Takuya Yoshioka
,
Jinyu Li
,
Sunit Sivasankaran
,
Sefik Emre Eskimez
Speech Separation with Large-Scale Self-Supervised Learning.
ICASSP
(2023)
Mohammad Soleymanpour
,
Mahmoud Al Ismail
,
Fahimeh Bahmaninezhad
,
Kshitiz Kumar
,
Jian Wu
Bilingual Streaming ASR with Grapheme units and Auxiliary Monolingual Loss.
CoRR
(2023)
Naoyuki Kanda
,
Jian Wu
,
Xiaofei Wang
,
Zhuo Chen
,
Jinyu Li
,
Takuya Yoshioka
Vararray Meets T-Sot: Advancing the State of the Art of Streaming Distant Conversational Speech Recognition.
ICASSP
(2023)
Muqiao Yang
,
Naoyuki Kanda
,
Xiaofei Wang
,
Jian Wu
,
Sunit Sivasankaran
,
Zhuo Chen
,
Jinyu Li
,
Takuya Yoshioka
Simulating realistic speech overlaps improves multi-talker ASR.
CoRR
(2022)
Mostafa Karimi
,
Changliang Liu
,
Ken'ichi Kumatani
,
Yao Qian
,
Tianyu Wu
,
Jian Wu
Deploying self-supervised learning in the wild for hybrid automatic speech recognition.
CoRR
(2022)
Sanyuan Chen
,
Yu Wu
,
Zhuo Chen
,
Jian Wu
,
Takuya Yoshioka
,
Shujie Liu
,
Jinyu Li
,
Xiangzhan Yu
Ultra Fast Speech Separation Model with Teacher Student Learning.
CoRR
(2022)
Sanyuan Chen
,
Yu Wu
,
Chengyi Wang
,
Shujie Liu
,
Zhuo Chen
,
Peidong Wang
,
Gang Liu
,
Jinyu Li
,
Jian Wu
,
Xiangzhan Yu
,
Furu Wei
Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition?
CoRR
(2022)
Naoyuki Kanda
,
Jian Wu
,
Yu Wu
,
Xiong Xiao
,
Zhong Meng
,
Xiaofei Wang
,
Yashesh Gaur
,
Zhuo Chen
,
Jinyu Li
,
Takuya Yoshioka
Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings.
INTERSPEECH
(2022)
Naoyuki Kanda
,
Jian Wu
,
Yu Wu
,
Xiong Xiao
,
Zhong Meng
,
Xiaofei Wang
,
Yashesh Gaur
,
Zhuo Chen
,
Jinyu Li
,
Takuya Yoshioka
Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings.
CoRR
(2022)
Yixuan Zhang
,
Zhuo Chen
,
Jian Wu
,
Takuya Yoshioka
,
Peidong Wang
,
Zhong Meng
,
Jinyu Li
Continuous Speech Separation with Recurrent Selective Attention Network.
ICASSP
(2022)
Sanyuan Chen
,
Chengyi Wang
,
Zhengyang Chen
,
Yu Wu
,
Shujie Liu
,
Zhuo Chen
,
Jinyu Li
,
Naoyuki Kanda
,
Takuya Yoshioka
,
Xiong Xiao
,
Jian Wu
,
Long Zhou
,
Shuo Ren
,
Yanmin Qian
,
Yao Qian
,
Jian Wu
,
Michael Zeng
,
Xiangzhan Yu
,
Furu Wei
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing.
IEEE J. Sel. Top. Signal Process.
16 (6) (2022)
Naoyuki Kanda
,
Jian Wu
,
Xiaofei Wang
,
Zhuo Chen
,
Jinyu Li
,
Takuya Yoshioka
VarArray Meets t-SOT: Advancing the State of the Art of Streaming Distant Conversational Speech Recognition.
CoRR
(2022)
Sanyuan Chen
,
Yu Wu
,
Chengyi Wang
,
Zhengyang Chen
,
Zhuo Chen
,
Shujie Liu
,
Jian Wu
,
Yao Qian
,
Furu Wei
,
Jinyu Li
,
Xiangzhan Yu
Unispeech-Sat: Universal Speech Representation Learning With Speaker Aware Pre-Training.
ICASSP
(2022)
Sanyuan Chen
,
Yu Wu
,
Chengyi Wang
,
Shujie Liu
,
Zhuo Chen
,
Peidong Wang
,
Gang Liu
,
Jinyu Li
,
Jian Wu
,
Xiangzhan Yu
,
Furu Wei
Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition?
INTERSPEECH
(2022)
Jian Wu
,
Zhuo Chen
,
Sanyuan Chen
,
Yu Wu
,
Takuya Yoshioka
,
Naoyuki Kanda
,
Shujie Liu
,
Jinyu Li
Investigation of Practical Aspects of Single Channel Speech Separation for ASR.
Interspeech
(2021)
Sanyuan Chen
,
Yu Wu
,
Zhuo Chen
,
Jian Wu
,
Takuya Yoshioka
,
Shujie Liu
,
Jinyu Li
,
Xiangzhan Yu
Ultra Fast Speech Separation Model with Teacher Student Learning.
Interspeech
(2021)
Yihui Fu
,
Luyao Cheng
,
Shubo Lv
,
Yukai Jv
,
Yuxiang Kong
,
Zhuo Chen
,
Yanxin Hu
,
Lei Xie
,
Jian Wu
,
Hui Bu
,
Xin Xu
,
Jun Du
,
Jingdong Chen
AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario.
Interspeech
(2021)