Login / Signup
Benlai Tang
Publication Activity (10 Years)
Years Active: 2016-2024
Publications (10 Years): 19
Top Topics
Wyner Ziv
Phoneme Recognition
Heterogeneous Sources
Predictive Coding
Top Venues
CoRR
ICASSP
ICME
ISCSLP
</>
Publications
</>
Quanxiu Wang
,
Hui Huang
,
Mingjie Wang
,
Yong Dai
,
Jinzuomu Zhong
,
Benlai Tang
Prior-agnostic Multi-scale Contrastive Text-Audio Pre-training for Parallelized TTS Frontend Modeling.
CoRR
(2024)
Jingning Xu
,
Benlai Tang
,
Mingjie Wang
,
Minghao Li
,
Meirong Ma
CPNet: Exploiting CLIP-based Attention Condenser and Probability Map Guidance for High-fidelity Talking Face Generation.
CoRR
(2023)
Jinzuomu Zhong
,
Yang Li
,
Hui Huang
,
Jie Liu
,
Zhiba Su
,
Jing Guo
,
Benlai Tang
,
Fengjie Zhu
Multi-Modal Automatic Prosody Annotation with Contrastive Pretraining of SSWP.
CoRR
(2023)
Jingning Xu
,
Benlai Tang
,
Mingjie Wang
,
Minghao Li
,
Meirong Ma
CPNet: Exploiting CLIP-based Attention Condenser and Probability Map Guidance for High-fidelity Talking Face Generation.
ICME
(2023)
Jie Liu
,
Zhiba Su
,
Hui Huang
,
CaiYan Wan
,
Quanxiu Wang
,
Jiangli Hong
,
Benlai Tang
,
Fengjie Zhu
TranssionADD: A multi-frame reinforcement based sequence tagging model for audio deepfake detection.
CoRR
(2023)
Jie Liu
,
Zhiba Su
,
Hui Huang
,
Caiyan Wan
,
Quanxiu Wang
,
Jiangli Hong
,
Benlai Tang
,
Fengjie Zhu
TranssionADD: A Multi-frame Reinforcement Based Sequence Tagging Model for Audio Deepfake Detection.
DADA@IJCAI
(2023)
Jingning Xu
,
Benlai Tang
,
Mingjie Wang
,
Siyuan Bian
,
Wenyi Guo
,
Xiang Yin
,
Zejun Ma
Towards Using Clothes Style Transfer for Scenario-Aware Person Video Generation.
ICASSP
(2022)
Chao Wang
,
Zhonghao Li
,
Benlai Tang
,
Xiang Yin
,
Yuan Wan
,
Yibiao Yu
,
Zejun Ma
Towards high-fidelity singing voice conversion with acoustic reference and contrastive predictive coding.
INTERSPEECH
(2022)
Tianyi Xie
,
Liucheng Liao
,
Cheng Bi
,
Benlai Tang
,
Xiang Yin
,
Jianfei Yang
,
Mingjie Wang
,
Jiali Yao
,
Yang Zhang
,
Zejun Ma
Towards Realistic Visual Dubbing with Heterogeneous Sources.
CoRR
(2022)
Jingning Xu
,
Benlai Tang
,
Mingjie Wang
,
Siyuan Bian
,
Wenyi Guo
,
Xiang Yin
,
Zejun Ma
Towards Using Clothes Style Transfer for Scenario-aware Person Video Generation.
CoRR
(2021)
Tianyi Xie
,
Liucheng Liao
,
Cheng Bi
,
Benlai Tang
,
Xiang Yin
,
Jianfei Yang
,
Mingjie Wang
,
Jiali Yao
,
Yang Zhang
,
Zejun Ma
Towards Realistic Visual Dubbing with Heterogeneous Sources.
ACM Multimedia
(2021)
Yu Gu
,
Xiang Yin
,
Yonghui Rao
,
Yuan Wan
,
Benlai Tang
,
Yang Zhang
,
Jitong Chen
,
Yuxuan Wang
,
Zejun Ma
ByteSing: A Chinese Singing Voice Synthesis System Using Duration Allocated Encoder-Decoder Acoustic Models and WaveRNN Vocoders.
ISCSLP
(2021)
Zhonghao Li
,
Benlai Tang
,
Xiang Yin
,
Yuan Wan
,
Ling Xu
,
Chen Shen
,
Zejun Ma
PPG-Based Singing Voice Conversion with Adversarial Representation Learning.
ICASSP
(2021)
Chao Wang
,
Zhonghao Li
,
Benlai Tang
,
Xiang Yin
,
Yuan Wan
,
Yibiao Yu
,
Zejun Ma
Towards High-fidelity Singing Voice Conversion with Acoustic Reference and Contrastive Predictive Coding.
CoRR
(2021)
Yu Gu
,
Xiang Yin
,
Yonghui Rao
,
Yuan Wan
,
Benlai Tang
,
Yang Zhang
,
Jitong Chen
,
Yuxuan Wang
,
Zejun Ma
ByteSing: A Chinese Singing Voice Synthesis System Using Duration Allocated Encoder-Decoder Acoustic Models and WaveRNN Vocoders.
CoRR
(2020)
Wenjie Li
,
Benlai Tang
,
Xiang Yin
,
Yushi Zhao
,
Wei Li
,
Kang Wang
,
Hao Huang
,
Yuxuan Wang
,
Zejun Ma
Improving Accent Conversion with Reference Encoder and End-To-End Text-To-Speech.
CoRR
(2020)
Zhonghao Li
,
Benlai Tang
,
Xiang Yin
,
Yuan Wan
,
Ling Xu
,
Chen Shen
,
Zejun Ma
PPG-based singing voice conversion with adversarial representation learning.
CoRR
(2020)
Qiao Tian
,
Bing Yang
,
Jing Chen
,
Benlai Tang
,
Shan Liu
Generative Adversarial Network based Speaker Adaptation for High Fidelity WaveNet Vocoder.
CoRR
(2018)
Bo Zhang
,
Yuqin Gan
,
Yan Song
,
Benlai Tang
Application of pronunciation knowledge on phoneme recognition by LSTM neural network.
ICPR
(2016)