​
Login / Signup
Zhenhui Ye
ORCID
Publication Activity (10 Years)
Years Active: 2020-2024
Publications (10 Years): 38
Top Topics
Recurrent Networks
Reinforcement Learning
Diffusion Models
Speech Synthesis
Top Venues
CoRR
ACL (Findings)
ACL (1)
ICLR
</>
Publications
</>
Zhenhui Ye
,
Tianyun Zhong
,
Yi Ren
,
Jiaqi Yang
,
Weichuang Li
,
Jiawei Huang
,
Ziyue Jiang
,
Jinzheng He
,
Rongjie Huang
,
Jinglin Liu
,
Chen Zhang
,
Xiang Yin
,
Zejun Ma
,
Zhou Zhao
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis.
ICLR
(2024)
Rongjie Huang
,
Chunlei Zhang
,
Yongqi Wang
,
Dongchao Yang
,
Jinchuan Tian
,
Zhenhui Ye
,
Luping Liu
,
Zehan Wang
,
Ziyue Jiang
,
Xuankai Chang
,
Jiatong Shi
,
Chao Weng
,
Zhou Zhao
,
Dong Yu
Make-A-Voice: Revisiting Voice Large Language Models as Scalable Multilingual and Multitask Learners.
ACL (1)
(2024)
Zehan Wang
,
Ziang Zhang
,
Xize Cheng
,
Rongjie Huang
,
Luping Liu
,
Zhenhui Ye
,
Haifeng Huang
,
Yang Zhao
,
Tao Jin
,
Peng Gao
,
Zhou Zhao
FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion.
CoRR
(2024)
Zhenhui Ye
,
Tianyun Zhong
,
Yi Ren
,
Jiaqi Yang
,
Weichuang Li
,
Jiawei Huang
,
Ziyue Jiang
,
Jinzheng He
,
Rongjie Huang
,
Jinglin Liu
,
Chen Zhang
,
Xiang Yin
,
Zejun Ma
,
Zhou Zhao
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis.
CoRR
(2024)
Ziyue Jiang
,
Jinglin Liu
,
Yi Ren
,
Jinzheng He
,
Zhenhui Ye
,
Shengpeng Ji
,
Qian Yang
,
Chen Zhang
,
Pengfei Wei
,
Chunfeng Wang
,
Xiang Yin
,
Zejun Ma
,
Zhou Zhao
Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis.
ICLR
(2024)
Rongjie Huang
,
Mingze Li
,
Dongchao Yang
,
Jiatong Shi
,
Xuankai Chang
,
Zhenhui Ye
,
Yuning Wu
,
Zhiqing Hong
,
Jiawei Huang
,
Jinglin Liu
,
Yi Ren
,
Yuexian Zou
,
Zhou Zhao
,
Shinji Watanabe
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head.
AAAI
(2024)
Rongjie Huang
,
Huadai Liu
,
Xize Cheng
,
Yi Ren
,
Linjun Li
,
Zhenhui Ye
,
Jinzheng He
,
Lichao Zhang
,
Jinglin Liu
,
Xiang Yin
,
Zhou Zhao
AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation.
ACL (1)
(2023)
Rongjie Huang
,
Jiawei Huang
,
Dongchao Yang
,
Yi Ren
,
Luping Liu
,
Mingze Li
,
Zhenhui Ye
,
Jinglin Liu
,
Xiang Yin
,
Zhou Zhao
Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models.
CoRR
(2023)
Jinzheng He
,
Jinglin Liu
,
Zhenhui Ye
,
Rongjie Huang
,
Chenye Cui
,
Huadai Liu
,
Zhou Zhao
RMSSinger: Realistic-Music-Score based Singing Voice Synthesis.
CoRR
(2023)
Zhenhui Ye
,
Jinzheng He
,
Ziyue Jiang
,
Rongjie Huang
,
Jiawei Huang
,
Jinglin Liu
,
Yi Ren
,
Xiang Yin
,
Zejun Ma
,
Zhou Zhao
GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking Face Generation.
CoRR
(2023)
Jiawei Huang
,
Yi Ren
,
Rongjie Huang
,
Dongchao Yang
,
Zhenhui Ye
,
Chen Zhang
,
Jinglin Liu
,
Xiang Yin
,
Zejun Ma
,
Zhou Zhao
Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation.
CoRR
(2023)
Rongjie Huang
,
Huadai Liu
,
Xize Cheng
,
Yi Ren
,
Linjun Li
,
Zhenhui Ye
,
Jinzheng He
,
Lichao Zhang
,
Jinglin Liu
,
Xiang Yin
,
Zhou Zhao
AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation.
CoRR
(2023)
Rongjie Huang
,
Mingze Li
,
Dongchao Yang
,
Jiatong Shi
,
Xuankai Chang
,
Zhenhui Ye
,
Yuning Wu
,
Zhiqing Hong
,
Jiawei Huang
,
Jinglin Liu
,
Yi Ren
,
Zhou Zhao
,
Shinji Watanabe
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head.
CoRR
(2023)
Yixiang Ren
,
Zhenhui Ye
,
Yining Chen
,
Xiaohong Jiang
,
Guanghua Song
Erratum to: Soft-HGRNs: soft hierarchical graph recurrent networks for multi-agent partially observable environments.
Frontiers Inf. Technol. Electron. Eng.
24 (3) (2023)
Yixiang Ren
,
Zhenhui Ye
,
Yining Chen
,
Xiaohong Jiang
,
Guanghua Song
Soft-HGRNs: soft hierarchical graph recurrent networks for multi-agent partially observable environments.
Frontiers Inf. Technol. Electron. Eng.
24 (1) (2023)
Zhenhui Ye
,
Ke Wang
,
Yining Chen
,
Xiaohong Jiang
,
Guanghua Song
Multi-UAV Navigation for Partially Observable Communication Coverage by Graph Reinforcement Learning.
IEEE Trans. Mob. Comput.
22 (7) (2023)
Zhenhui Ye
,
Ziyue Jiang
,
Yi Ren
,
Jinglin Liu
,
Jinzheng He
,
Zhou Zhao
GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis.
ICLR
(2023)
Rongjie Huang
,
Chunlei Zhang
,
Yongqi Wang
,
Dongchao Yang
,
Luping Liu
,
Zhenhui Ye
,
Ziyue Jiang
,
Chao Weng
,
Zhou Zhao
,
Dong Yu
Make-A-Voice: Unified Voice Synthesis With Discrete Representation.
CoRR
(2023)
Zhenhui Ye
,
Ziyue Jiang
,
Yi Ren
,
Jinglin Liu
,
Chen Zhang
,
Xiang Yin
,
Zejun Ma
,
Zhou Zhao
Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis.
CoRR
(2023)
Zhenhui Ye
,
Ziyue Jiang
,
Yi Ren
,
Jinglin Liu
,
Jinzheng He
,
Zhou Zhao
GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis.
CoRR
(2023)
Ziyue Jiang
,
Jinglin Liu
,
Yi Ren
,
Jinzheng He
,
Chen Zhang
,
Zhenhui Ye
,
Pengfei Wei
,
Chunfeng Wang
,
Xiang Yin
,
Zejun Ma
,
Zhou Zhao
Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech Prompts.
CoRR
(2023)
Zhenhui Ye
,
Rongjie Huang
,
Yi Ren
,
Ziyue Jiang
,
Jinglin Liu
,
Jinzheng He
,
Xiang Yin
,
Zhou Zhao
CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-Training.
ACL (1)
(2023)
Ziyue Jiang
,
Qian Yang
,
Jialong Zuo
,
Zhenhui Ye
,
Rongjie Huang
,
Yi Ren
,
Zhou Zhao
FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion Models.
CoRR
(2023)
Jinglin Liu
,
Zhenhui Ye
,
Qian Chen
,
Siqi Zheng
,
Wen Wang
,
Qinglin Zhang
,
Zhou Zhao
DopplerBAS: Binaural Audio Synthesis Addressing Doppler Effect.
ACL (Findings)
(2023)
Jinzheng He
,
Jinglin Liu
,
Zhenhui Ye
,
Rongjie Huang
,
Chenye Cui
,
Huadai Liu
,
Zhou Zhao
RMSSinger: Realistic-Music-Score based Singing Voice Synthesis.
ACL (Findings)
(2023)
Zhenhui Ye
,
Rongjie Huang
,
Yi Ren
,
Ziyue Jiang
,
Jinglin Liu
,
Jinzheng He
,
Xiang Yin
,
Zhou Zhao
CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-training.
CoRR
(2023)
Rongjie Huang
,
Jiawei Huang
,
Dongchao Yang
,
Yi Ren
,
Luping Liu
,
Mingze Li
,
Zhenhui Ye
,
Jinglin Liu
,
Xiang Yin
,
Zhou Zhao
Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models.
ICML
(2023)
Ziyue Jiang
,
Yi Ren
,
Zhenhui Ye
,
Jinglin Liu
,
Chen Zhang
,
Qian Yang
,
Shengpeng Ji
,
Rongjie Huang
,
Chunfeng Wang
,
Xiang Yin
,
Zejun Ma
,
Zhou Zhao
Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias.
CoRR
(2023)
Ziyue Jiang
,
Qian Yang
,
Jialong Zuo
,
Zhenhui Ye
,
Rongjie Huang
,
Yi Ren
,
Zhou Zhao
FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion Models.
ACL (Findings)
(2023)
Ziyue Jiang
,
Su Zhe
,
Zhou Zhao
,
Qian Yang
,
Yi Ren
,
Jinglin Liu
,
Zhenhui Ye
Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech.
CoRR
(2022)
Daner Hu
,
Zhenhui Ye
,
Yuanqi Gao
,
Zuzhao Ye
,
Yonggang Peng
,
Nanpeng Yu
Multi-Agent Deep Reinforcement Learning for Voltage Control With Coordinated Active and Reactive Power Optimization.
IEEE Trans. Smart Grid
13 (6) (2022)
Zhenhui Ye
,
Yining Chen
,
Xiaohong Jiang
,
Guanghua Song
,
Bowei Yang
,
Sheng Fan
Improving sample efficiency in Multi-Agent Actor-Critic methods.
Appl. Intell.
52 (4) (2022)
Yining Chen
,
Guanghua Song
,
Zhenhui Ye
,
Xiaohong Jiang
Scalable and Transferable Reinforcement Learning for Multi-Agent Mixed Cooperative-Competitive Environments Based on Hierarchical Graph Attention.
Entropy
24 (4) (2022)
Zhenhui Ye
,
Zhou Zhao
,
Yi Ren
,
Fei Wu
SyntaSpeech: Syntax-Aware Generative Adversarial Text-to-Speech.
CoRR
(2022)
Yixiang Ren
,
Zhenhui Ye
,
Guanghua Song
,
Xiaohong Jiang
Space-Air-Ground Integrated Mobile Crowdsensing for Partially Observable Data Collection by Multi-Scale Convolutional Graph Reinforcement Learning.
Entropy
24 (5) (2022)
Zhenhui Ye
,
Zhou Zhao
,
Yi Ren
,
Fei Wu
SyntaSpeech: Syntax-Aware Generative Adversarial Text-to-Speech.
IJCAI
(2022)
Zhenhui Ye
,
Xiaohong Jiang
,
Guanghua Song
,
Bowei Yang
Soft Hierarchical Graph Recurrent Networks for Many-Agent Partially Observable Environments.
CoRR
(2021)
Zhenhui Ye
,
Yining Chen
,
Guanghua Song
,
Bowei Yang
,
Sheng Fan
Experience Augmentation: Boosting and Accelerating Off-Policy Multi-Agent Reinforcement Learning.
CoRR
(2020)