​
Login / Signup
Jialong Zuo
Publication Activity (10 Years)
Years Active: 2023-2024
Publications (10 Years): 14
Top Topics
Fine Granularity
Diffusion Models
Context Aware Mobile
Speech Synthesis
Top Venues
CoRR
ACL (1)
ACL (Findings)
ICASSP
</>
Publications
</>
Shengpeng Ji
,
Jialong Zuo
,
Minghui Fang
,
Siqi Zheng
,
Qian Chen
,
Wen Wang
,
Ziyue Jiang
,
Hai Huang
,
Xize Cheng
,
Rongjie Huang
,
Zhou Zhao
ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control With Decoupled Codec.
CoRR
(2024)
Huaxin Zhang
,
Xiaohao Xu
,
Xiang Wang
,
Jialong Zuo
,
Chuchu Han
,
Xiaonan Huang
,
Changxin Gao
,
Yuehuan Wang
,
Nong Sang
Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLM.
CoRR
(2024)
Shengpeng Ji
,
Ziyue Jiang
,
Hanting Wang
,
Jialong Zuo
,
Zhou Zhao
MobileSpeech: A Fast and High-Fidelity Framework for Mobile Zero-Shot Text-to-Speech.
CoRR
(2024)
Qian Yang
,
Jialong Zuo
,
Zhe Su
,
Ziyue Jiang
,
Mingze Li
,
Zhou Zhao
,
Feiyang Chen
,
Zhefeng Wang
,
Baoxing Huai
MSceneSpeech: A Multi-Scene Speech Dataset For Expressive Speech Synthesis.
CoRR
(2024)
Jiahao Hong
,
Jialong Zuo
,
Chuchu Han
,
Ruochen Zheng
,
Ming Tian
,
Changxin Gao
,
Nong Sang
Spatial Cascaded Clustering and Weighted Memory for Unsupervised Person Re-identification.
CoRR
(2024)
Shengpeng Ji
,
Minghui Fang
,
Ziyue Jiang
,
Rongjie Huang
,
Jialong Zuo
,
Shulei Wang
,
Zhou Zhao
Language-Codec: Reducing the Gaps Between Discrete Codec Representation and Speech Language Models.
CoRR
(2024)
Shengpeng Ji
,
Jialong Zuo
,
Minghui Fang
,
Ziyue Jiang
,
Feiyang Chen
,
Xinyu Duan
,
Baoxing Huai
,
Zhou Zhao
TextrolSpeech: A Text Style Control Speech Corpus with Codec Language Text-to-Speech Models.
ICASSP
(2024)
Shengpeng Ji
,
Ziyue Jiang
,
Hanting Wang
,
Jialong Zuo
,
Zhou Zhao
MobileSpeech: A Fast and High-Fidelity Framework for Mobile Zero-Shot Text-to-Speech.
ACL (1)
(2024)
Minghui Fang
,
Shengpeng Ji
,
Jialong Zuo
,
Hai Huang
,
Yan Xia
,
Jieming Zhu
,
Xize Cheng
,
Xiaoda Yang
,
Wenrui Liu
,
Gang Wang
,
Zhenhua Dong
,
Zhou Zhao
ACE: A Generative Cross-Modal Retrieval Framework with Coarse-To-Fine Semantic Modeling.
CoRR
(2024)
Jialong Zuo
,
Hanyu Zhou
,
Ying Nie
,
Feng Zhang
,
Tianyu Guo
,
Nong Sang
,
Yunhe Wang
,
Changxin Gao
UFineBench: Towards Text-based Person Retrieval with Ultra-fine Granularity.
CoRR
(2023)
Shengpeng Ji
,
Jialong Zuo
,
Minghui Fang
,
Ziyue Jiang
,
Feiyang Chen
,
Xinyu Duan
,
Baoxing Huai
,
Zhou Zhao
TextrolSpeech: A Text Style Control Speech Corpus With Codec Language Text-to-Speech Models.
CoRR
(2023)
Jialong Zuo
,
Changqian Yu
,
Nong Sang
,
Changxin Gao
PLIP: Language-Image Pre-training for Person Representation Learning.
CoRR
(2023)
Ziyue Jiang
,
Qian Yang
,
Jialong Zuo
,
Zhenhui Ye
,
Rongjie Huang
,
Yi Ren
,
Zhou Zhao
FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion Models.
CoRR
(2023)
Ziyue Jiang
,
Qian Yang
,
Jialong Zuo
,
Zhenhui Ye
,
Rongjie Huang
,
Yi Ren
,
Zhou Zhao
FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion Models.
ACL (Findings)
(2023)