Login / Signup
Tao Wang
ORCID
Publication Activity (10 Years)
Years Active: 2021-2024
Publications (10 Years): 22
Top Topics
Diffusion Model
Speech Synthesis
Congestion Control
Audio Visual
Top Venues
CoRR
ICASSP
IEEE ACM Trans. Audio Speech Lang. Process.
DDAM@MM
</>
Publications
</>
Ruibo Fu
,
Xin Qi
,
Zhengqi Wen
,
Jianhua Tao
,
Tao Wang
,
Chunyu Qiang
,
Zhiyong Wang
,
Yi Lu
,
Xiaopeng Wang
,
Shuchen Shi
,
Yukun Liu
,
Xuefei Liu
,
Shuai Zhang
ASRRL-TTS: Agile Speaker Representation Reinforcement Learning for Text-to-Speech Speaker Adaptation.
CoRR
(2024)
Ruibo Fu
,
Shuchen Shi
,
Hongming Guo
,
Tao Wang
,
Chunyu Qiang
,
Zhengqi Wen
,
Jianhua Tao
,
Xin Qi
,
Yi Lu
,
Xiaopeng Wang
,
Zhiyong Wang
,
Yukun Liu
,
Xuefei Liu
,
Shuai Zhang
,
Guanjun Li
MINT: a Multi-modal Image and Narrative Text Dubbing Dataset for Foley Audio Content Planning and Generation.
CoRR
(2024)
Junzuo Zhou
,
Jiangyan Yi
,
Tao Wang
,
Jianhua Tao
,
Ye Bai
,
Chu Yuan Zhang
,
Yong Ren
,
Zhengqi Wen
TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking.
CoRR
(2024)
Yong Ren
,
Tao Wang
,
Jiangyan Yi
,
Le Xu
,
Jianhua Tao
,
Chu Yuan Zhang
,
Junzuo Zhou
Fewer-Token Neural Speech Codec with Time-Invariant Codes.
ICASSP
(2024)
Ruibo Fu
,
Rui Liu
,
Chunyu Qiang
,
Yingming Gao
,
Yi Lu
,
Shuchen Shi
,
Tao Wang
,
Ya Li
,
Zhengqi Wen
,
Chen Zhang
,
Hui Bu
,
Yukun Liu
,
Xin Qi
,
Guanjun Li
ICAGC 2024: Inspirational and Convincing Audio Generation Challenge 2024.
CoRR
(2024)
Chunyu Qiang
,
Hao Li
,
Hao Ni
,
He Qu
,
Ruibo Fu
,
Tao Wang
,
Longbiao Wang
,
Jianwu Dang
Minimally-Supervised Speech Synthesis with Conditional Diffusion Model and Language Model: A Comparative Study of Semantic Coding.
ICASSP
(2024)
Tao Wang
,
Jiangyan Yi
,
Ruibo Fu
,
Jianhua Tao
,
Zhengqi Wen
,
Chu Yuan Zhang
Emotion selectable end-to-end text-based speech editing.
Artif. Intell.
329 (2024)
Chunyu Qiang
,
Hao Li
,
Yixin Tian
,
Ruibo Fu
,
Tao Wang
,
Longbiao Wang
,
Jianwu Dang
Learning Speech Representation from Contrastive Token-Acoustic Pretraining.
ICASSP
(2024)
Chunyu Qiang
,
Hao Li
,
Yixin Tian
,
Ruibo Fu
,
Tao Wang
,
Longbiao Wang
,
Jianwu Dang
Learning Speech Representation From Contrastive Token-Acoustic Pretraining.
CoRR
(2023)
Chunyu Qiang
,
Hao Li
,
Hao Ni
,
He Qu
,
Ruibo Fu
,
Tao Wang
,
Longbiao Wang
,
Jianwu Dang
Minimally-Supervised Speech Synthesis with Conditional Diffusion Model and Language Model: A Comparative Study of Semantic Coding.
CoRR
(2023)
Jiangyan Yi
,
Jianhua Tao
,
Ruibo Fu
,
Tao Wang
,
Chu Yuan Zhang
,
Chenglong Wang
Adversarial Multi-Task Learning for Mandarin Prosodic Boundary Prediction With Multi-Modal Embeddings.
IEEE ACM Trans. Audio Speech Lang. Process.
31 (2023)
Xinrui Yan
,
Jiangyan Yi
,
Jianhua Tao
,
Chenglong Wang
,
Haoxin Ma
,
Tao Wang
,
Shiming Wang
,
Ruibo Fu
An Initial Investigation for Detecting Vocoder Fingerprints of Fake Audio.
CoRR
(2022)
Tao Wang
,
Ruibo Fu
,
Jiangyan Yi
,
Jianhua Tao
,
Zhengqi Wen
NeuralDPS: Neural Deterministic Plus Stochastic Model With Multiband Excitation for Noise-Controllable Waveform Generation.
IEEE ACM Trans. Audio Speech Lang. Process.
30 (2022)
Tao Wang
,
Jiangyan Yi
,
Liqun Deng
,
Ruibo Fu
,
Jianhua Tao
,
Zhengqi Wen
Context-Aware Mask Prediction Network for End-to-End Text-Based Speech Editing.
ICASSP
(2022)
Tao Wang
,
Ruibo Fu
,
Jiangyan Yi
,
Zhengqi Wen
,
Jianhua Tao
Singing-Tacotron: Global Duration Control Attention and Dynamic Filter for End-to-end Singing Voice Synthesis.
DDAM@MM
(2022)
Tao Wang
,
Ruibo Fu
,
Jiangyan Yi
,
Jianhua Tao
,
Zhengqi Wen
NeuralDPS: Neural Deterministic Plus Stochastic Model with Multiband Excitation for Noise-Controllable Waveform Generation.
CoRR
(2022)
Xinrui Yan
,
Jiangyan Yi
,
Jianhua Tao
,
Chenglong Wang
,
Haoxin Ma
,
Tao Wang
,
Shiming Wang
,
Ruibo Fu
An Initial Investigation for Detecting Vocoder Fingerprints of Fake Audio.
DDAM@MM
(2022)
Haoxin Ma
,
Jiangyan Yi
,
Chenglong Wang
,
Xinrui Yan
,
Jianhua Tao
,
Tao Wang
,
Shiming Wang
,
Le Xu
,
Ruibo Fu
FAD: A Chinese Dataset for Fake Audio Detection.
CoRR
(2022)
Chunyu Qiang
,
Jianhua Tao
,
Ruibo Fu
,
Zhengqi Wen
,
Jiangyan Yi
,
Tao Wang
,
Shiming Wang
Text Enhancement for Paragraph Processing in End-to-End Code-switching TTS.
CoRR
(2022)
Jiangyan Yi
,
Ruibo Fu
,
Jianhua Tao
,
Shuai Nie
,
Haoxin Ma
,
Chenglong Wang
,
Tao Wang
,
Zhengkun Tian
,
Ye Bai
,
Cunhang Fan
,
Shan Liang
,
Shiming Wang
,
Shuai Zhang
,
Xinrui Yan
,
Le Xu
,
Zhengqi Wen
,
Haizhou Li
ADD 2022: the first Audio Deep Synthesis Detection Challenge.
ICASSP
(2022)
Tao Wang
,
Jiangyan Yi
,
Ruibo Fu
,
Jianhua Tao
,
Zhengqi Wen
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing.
IEEE ACM Trans. Audio Speech Lang. Process.
30 (2022)
Jiangyan Yi
,
Ye Bai
,
Jianhua Tao
,
Haoxin Ma
,
Zhengkun Tian
,
Chenglong Wang
,
Tao Wang
,
Ruibo Fu
Half-Truth: A Partially Fake Audio Detection Dataset.
Interspeech
(2021)