​
Login / Signup
Huadai Liu
Publication Activity (10 Years)
Years Active: 2022-2024
Publications (10 Years): 16
Top Topics
Diffusion Model
Word Processing
Audio Visual
Visual Speech
Top Venues
CoRR
ACL (Findings)
EMNLP
ACL (1)
</>
Publications
</>
Huadai Liu
,
Jialei Wang
,
Rongjie Huang
,
Yang Liu
,
Jiayang Xu
,
Zhou Zhao
MEDIC: Zero-shot Music Editing with Disentangled Inversion Control.
CoRR
(2024)
Huadai Liu
,
Wenqiang Xu
,
Xuan Lin
,
Jingjing Huo
,
Hong Chen
,
Zhou Zhao
AntCritic: Argument Mining for Free-Form and Visually-Rich Financial Comments.
LREC/COLING
(2024)
Huadai Liu
,
Rongjie Huang
,
Yang Liu
,
Hengyuan Cao
,
Jialei Wang
,
Xize Cheng
,
Siqi Zheng
,
Zhou Zhao
AudioLCM: Text-to-Audio Generation with Latent Consistency Models.
CoRR
(2024)
Huadai Liu
,
Rongjie Huang
,
Jinzheng He
,
Gang Sun
,
Ran Shen
,
Xize Cheng
,
Zhou Zhao
Wav2SQL: Direct Generalizable Speech-To-SQL Parsing.
ACL (Findings)
(2024)
Huadai Liu
,
Rongjie Huang
,
Xuan Lin
,
Wenqiang Xu
,
Maozong Zheng
,
Hong Chen
,
Jinzheng He
,
Zhou Zhao
ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer.
EMNLP
(2023)
Rongjie Huang
,
Huadai Liu
,
Xize Cheng
,
Yi Ren
,
Linjun Li
,
Zhenhui Ye
,
Jinzheng He
,
Lichao Zhang
,
Jinglin Liu
,
Xiang Yin
,
Zhou Zhao
AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation.
ACL (1)
(2023)
Jinzheng He
,
Jinglin Liu
,
Zhenhui Ye
,
Rongjie Huang
,
Chenye Cui
,
Huadai Liu
,
Zhou Zhao
RMSSinger: Realistic-Music-Score based Singing Voice Synthesis.
CoRR
(2023)
Rongjie Huang
,
Huadai Liu
,
Xize Cheng
,
Yi Ren
,
Linjun Li
,
Zhenhui Ye
,
Jinzheng He
,
Lichao Zhang
,
Jinglin Liu
,
Xiang Yin
,
Zhou Zhao
AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation.
CoRR
(2023)
Huadai Liu
,
Rongjie Huang
,
Xuan Lin
,
Wenqiang Xu
,
Maozong Zheng
,
Hong Chen
,
Jinzheng He
,
Zhou Zhao
ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer.
CoRR
(2023)
Xize Cheng
,
Tao Jin
,
Rongjie Huang
,
Linjun Li
,
Wang Lin
,
Zehan Wang
,
Ye Wang
,
Huadai Liu
,
Aoxiong Yin
,
Zhou Zhao
MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition.
ICCV
(2023)
Huadai Liu
,
Rongjie Huang
,
Jinzheng He
,
Gang Sun
,
Ran Shen
,
Xize Cheng
,
Zhou Zhao
Wav2SQL: Direct Generalizable Speech-To-SQL Parsing.
CoRR
(2023)
Jinzheng He
,
Jinglin Liu
,
Zhenhui Ye
,
Rongjie Huang
,
Chenye Cui
,
Huadai Liu
,
Zhou Zhao
RMSSinger: Realistic-Music-Score based Singing Voice Synthesis.
ACL (Findings)
(2023)
Rongjie Huang
,
Jinglin Liu
,
Huadai Liu
,
Yi Ren
,
Lichao Zhang
,
Jinzheng He
,
Zhou Zhao
TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation.
ICLR
(2023)
Rongjie Huang
,
Zhou Zhao
,
Huadai Liu
,
Jinglin Liu
,
Chenye Cui
,
Yi Ren
ProDiff: Progressive Fast Diffusion Model For High-Quality Text-to-Speech.
CoRR
(2022)
Rongjie Huang
,
Zhou Zhao
,
Huadai Liu
,
Jinglin Liu
,
Chenye Cui
,
Yi Ren
ProDiff: Progressive Fast Diffusion Model for High-Quality Text-to-Speech.
ACM Multimedia
(2022)
Rongjie Huang
,
Zhou Zhao
,
Jinglin Liu
,
Huadai Liu
,
Yi Ren
,
Lichao Zhang
,
Jinzheng He
TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation.
CoRR
(2022)