Login / Signup
Qiushi Zhu
ORCID
Publication Activity (10 Years)
Years Active: 2015-2024
Publications (10 Years): 19
Top Topics
Vector Quantization
Speech Recognition
Language Model
Cross Modal
Top Venues
CoRR
ICASSP
ACL (Findings)
APSIPA ASC
</>
Publications
</>
Yuchen Hu
,
Chen Chen
,
Qiushi Zhu
,
Eng Siong Chng
Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASR.
IEEE ACM Trans. Audio Speech Lang. Process.
32 (2024)
Xiao-Ying Zhao
,
Qiushi Zhu
,
Yuchen Hu
An Experimental Comparison of Noise-Robust Text-To-Speech Synthesis Systems Based On Self-Supervised Representation.
ICASSP
(2024)
Yu Gu
,
Qiushi Zhu
,
Guangzhi Lei
,
Chao Weng
,
Dan Su
DurIAN-E 2: Duration Informed Attention Network with Adaptive Variational Autoencoder and Adversarial Learning for Expressive Text-to-Speech Synthesis.
ICASSP
(2024)
Qiushi Zhu
,
Jie Zhang
,
Yu Gu
,
Yuchen Hu
,
Lirong Dai
Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation.
AAAI
(2024)
Qiushi Zhu
,
Jie Zhang
,
Yu Gu
,
Yuchen Hu
,
Lirong Dai
Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation.
CoRR
(2024)
Qiushi Zhu
,
Long Zhou
,
Ziqiang Zhang
,
Shujie Liu
,
Binxing Jiao
,
Jie Zhang
,
Li-Rong Dai
,
Daxin Jiang
,
Jinyu Li
,
Furu Wei
VatLM: Visual-Audio-Text Pre-Training With Unified Masked Prediction for Speech Representation Learning.
IEEE Trans. Multim.
26 (2024)
Yuchen Hu
,
Chen Chen
,
Chengwei Qin
,
Qiushi Zhu
,
Eng Siong Chng
,
Ruizhe Li
Listen Again and Choose the Right Answer: A New Paradigm for Automatic Speech Recognition with Large Language Models.
CoRR
(2024)
Yuchen Hu
,
Chen Chen
,
Chengwei Qin
,
Qiushi Zhu
,
EngSiong Chng
,
Ruizhe Li
Listen Again and Choose the Right Answer: A New Paradigm for Automatic Speech Recognition with Large Language Models.
ACL (Findings)
(2024)
Yuchen Hu
,
Chen Chen
,
Ruizhe Li
,
Qiushi Zhu
,
Eng Siong Chng
Noise-aware Speech Enhancement using Diffusion Probabilistic Model.
CoRR
(2023)
Yuchen Hu
,
Ruizhe Li
,
Chen Chen
,
Heqing Zou
,
Qiushi Zhu
,
Eng Siong Chng
Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition.
IJCAI
(2023)
Yuchen Hu
,
Ruizhe Li
,
Chen Chen
,
Heqing Zou
,
Qiushi Zhu
,
Eng Siong Chng
Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition.
CoRR
(2023)
Yuchen Hu
,
Chen Chen
,
Ruizhe Li
,
Qiushi Zhu
,
Eng Siong Chng
Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition.
CoRR
(2023)
Xiao-Ying Zhao
,
Qiushi Zhu
,
Jie Zhang
,
Yeping Zhou
,
Peiqi Liu
Speech Enhancement with Multi-granularity Vector Quantization.
APSIPA ASC
(2023)
Yuchen Hu
,
Ruizhe Li
,
Chen Chen
,
Chengwei Qin
,
Qiushi Zhu
,
Eng Siong Chng
Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition.
CoRR
(2023)
Yuchen Hu
,
Chen Chen
,
Qiushi Zhu
,
Eng Siong Chng
Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASR.
CoRR
(2023)
Qiushi Zhu
,
Yu Gu
,
Chao Weng
,
Yuchen Hu
,
Lirong Dai
,
Jie Zhang
Rep2wav: Noise Robust text-to-speech Using self-supervised representations.
CoRR
(2023)
Yuchen Hu
,
Chen Chen
,
Ruizhe Li
,
Qiushi Zhu
,
Eng Siong Chng
Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition.
ICASSP
(2023)
Hao Peng
,
Qiushi Zhu
Approximate evaluation of average downtime under an integrated approach of opportunistic maintenance for multi-component systems.
Comput. Ind. Eng.
109 (2017)
Qiushi Zhu
,
Hao Peng
,
Geert-Jan van Houtum
A condition-based maintenance policy for multi-component systems with a high maintenance setup cost.
OR Spectr.
37 (4) (2015)