Login / Signup
Disong Wang
ORCID
Publication Activity (10 Years)
Years Active: 2014-2024
Publications (10 Years): 27
Top Topics
Mutual Information
Gene Expression Profiles
Speech Synthesis
Visual Similarity
Top Venues
CoRR
ICASSP
Interspeech
IEEE ACM Trans. Audio Speech Lang. Process.
</>
Publications
</>
Xueyuan Chen
,
Yuejiao Wang
,
Xixin Wu
,
Disong Wang
,
Zhiyong Wu
,
Xunying Liu
,
Helen Meng
Exploiting Audio-Visual Features with Pretrained AV-HuBERT for Multi-Modal Dysarthric Speech Reconstruction.
CoRR
(2024)
Yuejiao Wang
,
Xixin Wu
,
Disong Wang
,
Lingwei Meng
,
Helen Meng
UNIT-DSR: Dysarthric Speech Reconstruction System Using Speech Unit Normalization.
CoRR
(2024)
Yuejiao Wang
,
Xixin Wu
,
Disong Wang
,
Lingwei Meng
,
Helen Meng
UNIT-DSR: Dysarthric Speech Reconstruction System Using Speech Unit Normalization.
ICASSP
(2024)
Xueyuan Chen
,
Yuejiao Wang
,
Xixin Wu
,
Disong Wang
,
Zhiyong Wu
,
Xunying Liu
,
Helen Meng
Exploiting Audio-Visual Features with Pretrained AV-HuBERT for Multi-Modal Dysarthric Speech Reconstruction.
ICASSP
(2024)
Disong Wang
,
Songxiang Liu
,
Xixin Wu
,
Hui Lu
,
Lifa Sun
,
Xunying Liu
,
Helen Meng
Speaker Identity Preservation in Dysarthric Speech Reconstruction by Adversarial Speaker Adaptation.
CoRR
(2022)
Hui Lu
,
Disong Wang
,
Xixin Wu
,
Zhiyong Wu
,
Xunying Liu
,
Helen Meng
Disentangled Speech Representation Learning for One-Shot Cross-lingual Voice Conversion Using β-VAE.
CoRR
(2022)
Disong Wang
,
Shan Yang
,
Dan Su
,
Xunying Liu
,
Dong Yu
,
Helen Meng
VCVTS: Multi-Speaker Video-to-Speech Synthesis Via Cross-Modal Knowledge Transfer from Voice Conversion.
ICASSP
(2022)
Disong Wang
,
Shan Yang
,
Dan Su
,
Xunying Liu
,
Dong Yu
,
Helen Meng
VCVTS: Multi-speaker Video-to-Speech synthesis via cross-modal knowledge transfer from voice conversion.
CoRR
(2022)
Disong Wang
,
Songxiang Liu
,
Xixin Wu
,
Hui Lu
,
Lifa Sun
,
Xunying Liu
,
Helen Meng
Speaker Identity Preservation in Dysarthric Speech Reconstruction by Adversarial Speaker Adaptation.
ICASSP
(2022)
Hui Lu
,
Disong Wang
,
Xixin Wu
,
Zhiyong Wu
,
Xunying Liu
,
Helen Meng
Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using ß-VAE.
SLT
(2022)
Disong Wang
,
Liqun Deng
,
Yu Ting Yeung
,
Xiao Chen
,
Xunying Liu
,
Helen Meng
Unsupervised Domain Adaptation for Dysarthric Speech Detection via Domain Adversarial Training and Mutual Information Minimization.
CoRR
(2021)
Songxiang Liu
,
Yuewen Cao
,
Disong Wang
,
Xixin Wu
,
Xunying Liu
,
Helen Meng
Any-to-Many Voice Conversion With Location-Relative Sequence-to-Sequence Modeling.
IEEE ACM Trans. Audio Speech Lang. Process.
29 (2021)
Disong Wang
,
Liqun Deng
,
Yu Ting Yeung
,
Xiao Chen
,
Xunying Liu
,
Helen Meng
Unsupervised Domain Adaptation for Dysarthric Speech Detection via Domain Adversarial Training and Mutual Information Minimization.
Interspeech
(2021)
Disong Wang
,
Jianwei Yu
,
Xixin Wu
,
Lifa Sun
,
Xunying Liu
,
Helen Meng
Improved End-to-End Dysarthric Speech Recognition via Meta-learning Based Model Re-initialization.
ISCSLP
(2021)
Disong Wang
,
Songxiang Liu
,
Lifa Sun
,
Xixin Wu
,
Xunying Liu
,
Helen Meng
Learning Explicit Prosody Models and Deep Speaker Embeddings for Atypical Voice Conversion.
Interspeech
(2021)
Disong Wang
,
Liqun Deng
,
Yu Ting Yeung
,
Xiao Chen
,
Xunying Liu
,
Helen Meng
VQMIVC: Vector Quantization and Mutual Information-Based Unsupervised Speech Representation Disentanglement for One-Shot Voice Conversion.
Interspeech
(2021)
Disong Wang
,
Liqun Deng
,
Yu Ting Yeung
,
Xiao Chen
,
Xunying Liu
,
Helen Meng
VQMIVC: Vector Quantization and Mutual Information-Based Unsupervised Speech Representation Disentanglement for One-shot Voice Conversion.
CoRR
(2021)
Xixin Wu
,
Yuewen Cao
,
Hui Lu
,
Songxiang Liu
,
Disong Wang
,
Zhiyong Wu
,
Xunying Liu
,
Helen Meng
Speech Emotion Recognition Using Sequential Capsule Networks.
IEEE ACM Trans. Audio Speech Lang. Process.
29 (2021)
Disong Wang
,
Liqun Deng
,
Yang Zhang
,
Nianzu Zheng
,
Yu Ting Yeung
,
Xiao Chen
,
Xunying Liu
,
Helen Meng
Fcl-Taco2: Towards Fast, Controllable and Lightweight Text-to-Speech Synthesis.
ICASSP
(2021)
Disong Wang
,
Jianwei Yu
,
Xixin Wu
,
Songxiang Liu
,
Lifa Sun
,
Xunying Liu
,
Helen Meng
End-To-End Voice Conversion Via Cross-Modal Knowledge Distillation for Dysarthric Speech Reconstruction.
ICASSP
(2020)
Songxiang Liu
,
Disong Wang
,
Yuewen Cao
,
Lifa Sun
,
Xixin Wu
,
Shiyin Kang
,
Zhiyong Wu
,
Xunying Liu
,
Dan Su
,
Dong Yu
,
Helen Meng
End-To-End Accent Conversion Without Using Native Utterances.
ICASSP
(2020)
Songxiang Liu
,
Yuewen Cao
,
Disong Wang
,
Xixin Wu
,
Xunying Liu
,
Helen Meng
Any-to-Many Voice Conversion with Location-Relative Sequence-to-Sequence Modeling.
CoRR
(2020)
Disong Wang
,
Songxiang Liu
,
Lifa Sun
,
Xixin Wu
,
Xunying Liu
,
Helen Meng
Learning Explicit Prosody Models and Deep Speaker Embeddings for Atypical Voice Conversion.
CoRR
(2020)
Disong Wang
,
Yuexian Zou
Joint Noise and Reverberation Adaptive Learning for Robust Speaker DOA Estimation with an Acoustic Vector Sensor.
INTERSPEECH
(2018)
Disong Wang
,
Yuexian Zou
,
Wenwu Wang
Learning soft mask with DNN and DNN-SVM for multi-speaker DOA estimation using an acoustic vector sensor.
J. Frankl. Inst.
355 (4) (2018)
Yuexian Zou
,
Rongzhi Gu
,
Disong Wang
,
Aimin Jiang
,
Christian H. Ritz
Learning a robust DOA estimation model with acoustic vector sensor cues.
APSIPA
(2017)
Xiangyi Li
,
Yingjie Xu
,
Hui Cui
,
Tao Huang
,
Disong Wang
,
Baofeng Lian
,
Wei Li
,
Guangrong Qin
,
Lanming Chen
,
Lu Xie
Prediction of synergistic anti-cancer drug combinations based on drug target network and drug induced gene expression profiles.
Artif. Intell. Medicine
83 (2017)
Xiansheng Guo
,
Baocang Li
,
Lei Chu
,
Disong Wang
Near-field source localization in complex indoor environment using uniform circular array.
ChinaSIP
(2014)