Login / Signup
Xixin Wu
ORCID
Publication Activity (10 Years)
Years Active: 2012-2024
Publications (10 Years): 134
Top Topics
Speaker Verification
Speech Synthesis
Neural Network
Autoregressive
Top Venues
CoRR
ICASSP
INTERSPEECH
IEEE ACM Trans. Audio Speech Lang. Process.
</>
Publications
</>
Weiqin Li
,
Peiji Yang
,
Yicheng Zhong
,
Yixuan Zhou
,
Zhisheng Wang
,
Zhiyong Wu
,
Xixin Wu
,
Helen Meng
Spontaneous Style Text-to-Speech Synthesis with Controllable Spontaneous Behaviors Based on Language Models.
CoRR
(2024)
Yuejiao Wang
,
Xianmin Gong
,
Lingwei Meng
,
Xixin Wu
,
Helen Meng
Large Language Model-based FMRI Encoding of Language Functions for Subjects with Neurocognitive Disorder.
CoRR
(2024)
Xueyuan Chen
,
Yuejiao Wang
,
Xixin Wu
,
Disong Wang
,
Zhiyong Wu
,
Xunying Liu
,
Helen Meng
Exploiting Audio-Visual Features with Pretrained AV-HuBERT for Multi-Modal Dysarthric Speech Reconstruction.
CoRR
(2024)
Boshi Tang
,
Zhiyong Wu
,
Xixin Wu
,
Qiaochu Huang
,
Jun Chen
,
Shun Lei
,
Helen Meng
SimCalib: Graph Neural Network Calibration Based on Similarity between Nodes.
AAAI
(2024)
Yuejiao Wang
,
Xixin Wu
,
Disong Wang
,
Lingwei Meng
,
Helen Meng
UNIT-DSR: Dysarthric Speech Reconstruction System Using Speech Unit Normalization.
CoRR
(2024)
Xiaohan Feng
,
Xixin Wu
,
Helen Meng
Injecting Linguistic Knowledge Into BERT for Dialogue State Tracking.
IEEE Access
12 (2024)
Yuejiao Wang
,
Xixin Wu
,
Disong Wang
,
Lingwei Meng
,
Helen Meng
UNIT-DSR: Dysarthric Speech Reconstruction System Using Speech Unit Normalization.
ICASSP
(2024)
Haohan Guo
,
Fenglong Xie
,
Dongchao Yang
,
Hui Lu
,
Xixin Wu
,
Helen Meng
Addressing Index Collapse of Large-Codebook Speech Tokenizer with Dual-Decoding Product-Quantized Variational Auto-Encoder.
CoRR
(2024)
Tianhua Zhang
,
Kun Li
,
Hongyin Luo
,
Xixin Wu
,
James R. Glass
,
Helen Meng
Adaptive Query Rewriting: Aligning Rewriters through Marginal Probability of Conversational Answers.
CoRR
(2024)
Lingwei Meng
,
Long Zhou
,
Shujie Liu
,
Sanyuan Chen
,
Bing Han
,
Shujie Hu
,
Yanqing Liu
,
Jinyu Li
,
Sheng Zhao
,
Xixin Wu
,
Helen Meng
,
Furu Wei
Autoregressive Speech Synthesis without Vector Quantization.
CoRR
(2024)
Xueyuan Chen
,
Xi Wang
,
Shaofei Zhang
,
Lei He
,
Zhiyong Wu
,
Xixin Wu
,
Helen Meng
Stylespeech: Self-Supervised Style Enhancing with VQ-VAE-Based Pre-Training for Expressive Audiobook Speech Synthesis.
ICASSP
(2024)
Xueyuan Chen
,
Yuejiao Wang
,
Xixin Wu
,
Disong Wang
,
Zhiyong Wu
,
Xunying Liu
,
Helen Meng
Exploiting Audio-Visual Features with Pretrained AV-HuBERT for Multi-Modal Dysarthric Speech Reconstruction.
ICASSP
(2024)
Dongchao Yang
,
Dingdong Wang
,
Haohan Guo
,
Xueyuan Chen
,
Xixin Wu
,
Helen Meng
SimpleSpeech: Towards Simple and Efficient Text-to-Speech with Scalar Latent Transformer Diffusion Models.
CoRR
(2024)
Dongchao Yang
,
Haohan Guo
,
Yuanyuan Wang
,
Rongjie Huang
,
Xiang Li
,
Xu Tan
,
Xixin Wu
,
Helen Meng
UniAudio 1.5: Large Language Model-driven Audio Codec is A Few-shot Audio Task Learner.
CoRR
(2024)
Jiawen Kang
,
Lingwei Meng
,
Mingyu Cui
,
Haohan Guo
,
Xixin Wu
,
Xunying Liu
,
Helen Meng
Cross-Speaker Encoding Network for Multi-Talker Speech Recognition.
ICASSP
(2024)
Xueyuan Chen
,
Dongchao Yang
,
Dingdong Wang
,
Xixin Wu
,
Zhiyong Wu
,
Helen Meng
CoLM-DSR: Leveraging Neural Codec Language Modeling for Multi-Modal Dysarthric Speech Reconstruction.
CoRR
(2024)
Zihao Yang
,
Xixin Wu
,
Xindang He
,
Xiaofei Guan
A multiscale analysis-assisted two-stage reduced-order deep learning approach for effective thermal conductivity of arbitrary contrast heterogeneous materials.
Eng. Appl. Artif. Intell.
136 (2024)
Jing Xu
,
Minglin Wu
,
Xixin Wu
,
Helen Meng
Seamless Language Expansion: Enhancing Multilingual Mastery in Self-Supervised Models.
CoRR
(2024)
Jingyan Zhou
,
Kun Li
,
Junan Li
,
Jiawen Kang
,
Minda Hu
,
Xixin Wu
,
Helen Meng
Purple-teaming LLMs with Adversarial Defender Training.
CoRR
(2024)
Hui Lu
,
Xixin Wu
,
Haohan Guo
,
Songxiang Liu
,
Zhiyong Wu
,
Helen Meng
Unifying One-Shot Voice Conversion and Cloning with Disentangled Speech Representations.
ICASSP
(2024)
Jiawen Kang
,
Lingwei Meng
,
Mingyu Cui
,
Haohan Guo
,
Xixin Wu
,
Xunying Liu
,
Helen Meng
Cross-Speaker Encoding Network for Multi-Talker Speech Recognition.
CoRR
(2024)
Wenxuan Wu
,
Xueyuan Chen
,
Xixin Wu
,
Haizhou Li
,
Helen Meng
Target Speech Extraction with Pre-trained AV-HuBERT and Mask-And-Recover Strategy.
CoRR
(2024)
Lingwei Meng
,
Jiawen Kang
,
Yuejiao Wang
,
Zengrui Jin
,
Xixin Wu
,
Xunying Liu
,
Helen Meng
Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System.
CoRR
(2024)
Shun Lei
,
Yixuan Zhou
,
Liyang Chen
,
Dan Luo
,
Zhiyong Wu
,
Xixin Wu
,
Shiyin Kang
,
Tao Jiang
,
Yahui Zhou
,
Yuxing Han
,
Helen Meng
Improving Language Model-Based Zero-Shot Text-to-Speech Synthesis with Multi-Scale Acoustic Prompts.
ICASSP
(2024)
Jinchao Li
,
Xixin Wu
,
Kaitao Song
,
Dongsheng Li
,
Xunying Liu
,
Helen Meng
A Hierarchical Regression Chain Framework for Affective Vocal Burst Recognition.
ICASSP
(2023)
Lingwei Meng
,
Jiawen Kang
,
Mingyu Cui
,
Haibin Wu
,
Xixin Wu
,
Helen Meng
Unified Modeling of Multi-Talker Overlapped Speech Recognition and Diarization with a Sidecar Separator.
INTERSPEECH
(2023)
Shun Lei
,
Yixuan Zhou
,
Liyang Chen
,
Zhiyong Wu
,
Xixin Wu
,
Shiyin Kang
,
Helen Meng
MSStyleTTS: Multi-Scale Style Modeling With Hierarchical Context Information for Expressive Speech Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process.
31 (2023)
Jinchao Li
,
Kaitao Song
,
Junan Li
,
Bo Zheng
,
Dongsheng Li
,
Xixin Wu
,
Xunying Liu
,
Helen Meng
Leveraging Pretrained Representations with Task-related Keywords for Alzheimer's Disease Detection.
CoRR
(2023)
Wen Wu
,
Chao Zhang
,
Xixin Wu
,
Philip C. Woodland
Estimating the Uncertainty in Emotion Class Labels With Utterance-Specific Dirichlet Priors.
IEEE Trans. Affect. Comput.
14 (4) (2023)
Jinchao Li
,
Kaitao Song
,
Junan Li
,
Bo Zheng
,
Dongsheng Li
,
Xixin Wu
,
Xunying Liu
,
Helen Meng
Leveraging Pretrained Representations With Task-Related Keywords for Alzheimer's Disease Detection.
ICASSP
(2023)
Lingwei Meng
,
Jiawen Kang
,
Mingyu Cui
,
Haibin Wu
,
Xixin Wu
,
Helen Meng
Unified Modeling of Multi-Talker Overlapped Speech Recognition and Diarization with a Sidecar Separator.
CoRR
(2023)
Hongyin Luo
,
Tianhua Zhang
,
Yung-Sung Chuang
,
Yuan Gong
,
Yoon Kim
,
Xixin Wu
,
Helen Meng
,
James R. Glass
Search Augmented Instruction Learning.
EMNLP (Findings)
(2023)
Haohan Guo
,
Fenglong Xie
,
Jiawen Kang
,
Yujia Xiao
,
Xixin Wu
,
Helen Meng
QS-TTS: Towards Semi-Supervised Text-to-Speech Synthesis via Vector-Quantized Self-Supervised Speech Representation Learning.
CoRR
(2023)
HoLam Chung
,
Junan Li
,
Pengfei Liu
,
Wai-Kim Leung
,
Xixin Wu
,
Helen Meng
Improving Rare Words Recognition through Homophone Extension and Unified Writing for Low-resource Cantonese Speech Recognition.
CoRR
(2023)
Dongchao Yang
,
Jinchuan Tian
,
Xu Tan
,
Rongjie Huang
,
Songxiang Liu
,
Xuankai Chang
,
Jiatong Shi
,
Sheng Zhao
,
Jiang Bian
,
Xixin Wu
,
Zhou Zhao
,
Shinji Watanabe
,
Helen Meng
UniAudio: An Audio Foundation Model Toward Universal Audio Generation.
CoRR
(2023)
Tianhua Zhang
,
Jiaxin Ge
,
Hongyin Luo
,
Yung-Sung Chuang
,
Mingye Gao
,
Yuan Gong
,
Xixin Wu
,
Yoon Kim
,
Helen Meng
,
James R. Glass
Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning.
CoRR
(2023)
Tianhua Zhang
,
Hongyin Luo
,
Yung-Sung Chuang
,
Wei Fang
,
Luc Gaitskell
,
Thomas Hartvigsen
,
Xixin Wu
,
Danny Fox
,
Helen Meng
,
James R. Glass
Interpretable Unified Language Checking.
CoRR
(2023)
Haohan Guo
,
Fenglong Xie
,
Xixin Wu
,
Frank K. Soong
,
Helen Meng
MSMC-TTS: Multi-Stage Multi-Codebook VQ-VAE Based Neural TTS.
IEEE ACM Trans. Audio Speech Lang. Process.
31 (2023)
Hongyin Luo
,
Yung-Sung Chuang
,
Yuan Gong
,
Tianhua Zhang
,
Yoon Kim
,
Xixin Wu
,
Danny Fox
,
Helen Meng
,
James R. Glass
SAIL: Search-Augmented Instruction Learning.
CoRR
(2023)
Yuhao Liu
,
Cheng Gong
,
Longbiao Wang
,
Xixin Wu
,
Qiuyu Liu
,
Jianwu Dang
VF-Taco2: Towards Fast and Lightweight Synthesis for Autoregressive Models with Variation Autoencoder and Feature Distillation.
ICASSP
(2023)
Shun Lei
,
Yixuan Zhou
,
Liyang Chen
,
Zhiyong Wu
,
Xixin Wu
,
Shiyin Kang
,
Helen Meng
MSStyleTTS: Multi-Scale Style Modeling with Hierarchical Context Information for Expressive Speech Synthesis.
CoRR
(2023)
Xiaohan Feng
,
Xixin Wu
,
Helen Meng
Injecting linguistic knowledge into BERT for Dialogue State Tracking.
CoRR
(2023)
Helen Meng
,
Brian Mak
,
Man-Wai Mak
,
Helene H. Fung
,
Xianmin Gong
,
Timothy C. Y. Kwok
,
Xunying Liu
,
Vincent C. T. Mok
,
Patrick C. M. Wong
,
Jean Woo
,
Xixin Wu
,
Ka Ho Wong
,
Sean Shensheng Xu
,
Naijun Zheng
,
Ranzo Huang
,
Jiawen Kang
,
Xiaoquan Ke
,
Junan Li
,
Jinchao Li
,
Yi Wang
Integrated and Enhanced Pipeline System to Support Spoken Language Analytics for Screening Neurocognitive Disorders.
INTERSPEECH
(2023)
Jingyan Zhou
,
Minda Hu
,
Junan Li
,
Xiaoying Zhang
,
Xixin Wu
,
Irwin King
,
Helen Meng
Rethinking Machine Ethics - Can LLMs Perform Moral Reasoning through the Lens of Moral Theories?
CoRR
(2023)
Lingwei Meng
,
Jiawen Kang
,
Mingyu Cui
,
Yuejiao Wang
,
Xixin Wu
,
Helen Meng
A Sidecar Separator Can Convert a Single-Speaker Speech Recognition System to a Multi-Speaker One.
CoRR
(2023)
Jie Chen
,
Changhe Song
,
Deyi Tuo
,
Xixin Wu
,
Shiyin Kang
,
Zhiyong Wu
,
Helen Meng
Improving Mandarin Prosodic Structure Prediction with Multi-level Contextual Information.
CoRR
(2023)
Hui Lu
,
Xixin Wu
,
Zhiyong Wu
,
Helen Meng
SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody.
ACM Multimedia
(2023)
Boshi Tang
,
Zhiyong Wu
,
Xixin Wu
,
Qiaochu Huang
,
Jun Chen
,
Shun Lei
,
Helen Meng
SimCalib: Graph Neural Network Calibration based on Similarity between Nodes.
CoRR
(2023)
Shun Lei
,
Yixuan Zhou
,
Liyang Chen
,
Dan Luo
,
Zhiyong Wu
,
Xixin Wu
,
Shiyin Kang
,
Tao Jiang
,
Yahui Zhou
,
Yuxing Han
,
Helen Meng
Improving Language Model-Based Zero-Shot Text-to-Speech Synthesis with Multi-Scale Acoustic Prompts.
CoRR
(2023)
Yunxiang Li
,
Pengfei Liu
,
Xixin Wu
,
Helen Meng
PunCantonese: A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts.
INTERSPEECH
(2023)
Xueyuan Chen
,
Xi Wang
,
Shaofei Zhang
,
Lei He
,
Zhiyong Wu
,
Xixin Wu
,
Helen Meng
StyleSpeech: Self-supervised Style Enhancing with VQ-VAE-based Pre-training for Expressive Audiobook Speech Synthesis.
CoRR
(2023)
Jinchao Li
,
Xixin Wu
,
Kaitao Song
,
Dongsheng Li
,
Xunying Liu
,
Helen Meng
A Hierarchical Regression Chain Framework for Affective Vocal Burst Recognition.
CoRR
(2023)
Lingwei Meng
,
Jiawen Kang
,
Mingyu Cui
,
Yuejiao Wang
,
Xixin Wu
,
Helen Meng
A Sidecar Separator Can Convert A Single-Talker Speech Recognition System to A Multi-Talker One.
ICASSP
(2023)
Xixin Wu
,
Hui Lu
,
Kun Li
,
Zhiyong Wu
,
Xunying Liu
,
Helen Meng
Hiformer: Sequence Modeling Networks With Hierarchical Attention Mechanisms.
IEEE ACM Trans. Audio Speech Lang. Process.
31 (2023)
Jingbei Li
,
Yi Meng
,
Xixin Wu
,
Zhiyong Wu
,
Jia Jia
,
Helen Meng
,
Qiao Tian
,
Yuping Wang
,
Yuxuan Wang
Inferring Speaking Styles from Multi-modal Conversational Context by Multi-scale Relational Graph Convolutional Networks.
ACM Multimedia
(2022)
Xixin Wu
,
Shoukang Hu
,
Zhiyong Wu
,
Xunying Liu
,
Helen Meng
Neural Architecture Search for Speech Emotion Recognition.
CoRR
(2022)
Disong Wang
,
Songxiang Liu
,
Xixin Wu
,
Hui Lu
,
Lifa Sun
,
Xunying Liu
,
Helen Meng
Speaker Identity Preservation in Dysarthric Speech Reconstruction by Adversarial Speaker Adaptation.
CoRR
(2022)
Hui Lu
,
Disong Wang
,
Xixin Wu
,
Zhiyong Wu
,
Xunying Liu
,
Helen Meng
Disentangled Speech Representation Learning for One-Shot Cross-lingual Voice Conversion Using β-VAE.
CoRR
(2022)
Haibin Wu
,
Jiawen Kang
,
Lingwei Meng
,
Yang Zhang
,
Xixin Wu
,
Zhiyong Wu
,
Hung-yi Lee
,
Helen Meng
Tackling Spoofing-Aware Speaker Verification with Multi-Model Fusion.
Odyssey
(2022)
Jie Chen
,
Changhe Song
,
Deyi Tuo
,
Xixin Wu
,
Shiyin Kang
,
Zhiyong Wu
,
Helen Meng
Improving Mandarin Prosodic Structure Prediction with Multi-level Contextual Information.
INTERSPEECH
(2022)
Wen Wu
,
Chao Zhang
,
Xixin Wu
,
Philip C. Woodland
Estimating the Uncertainty in Emotion Class Labels with Utterance-Specific Dirichlet Priors.
CoRR
(2022)
Haohan Guo
,
Fenglong Xie
,
Xixin Wu
,
Hui Lu
,
Helen Meng
Towards High-Quality Neural TTS for Low-Resource Languages by Learning Compact Speech Representations.
CoRR
(2022)
Haohan Guo
,
Hui Lu
,
Xixin Wu
,
Helen Meng
A Multi-Scale Time-Frequency Spectrogram Discriminator for GAN-based Non-Autoregressive TTS.
INTERSPEECH
(2022)
Haibin Wu
,
Bo Zheng
,
Xu Li
,
Xixin Wu
,
Hung-Yi Lee
,
Helen Meng
Characterizing the Adversarial Vulnerability of Speech self-Supervised Learning.
ICASSP
(2022)
Hang Su
,
Danyang Zhao
,
Long Dang
,
Minglei Li
,
Xixin Wu
,
Xunying Liu
,
Helen Meng
A Multitask Learning Framework for Speaker Change Detection with Content Information from Unsupervised Speech Decomposition.
ICASSP
(2022)
Naijun Zheng
,
Na Li
,
Xixin Wu
,
Lingwei Meng
,
Jiawen Kang
,
Haibin Wu
,
Chao Weng
,
Dan Su
,
Helen Meng
The CUHK-Tencent Speaker Diarization System for the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Challenge.
ICASSP
(2022)
Haohan Guo
,
Feng-Long Xie
,
Frank K. Soong
,
Xixin Wu
,
Helen Meng
A Multi-Stage Multi-Codebook VQ-VAE Approach to High-Performance Neural TTS.
CoRR
(2022)
Yi Wang
,
Tianzi Wang
,
Zi Ye
,
Lingwei Meng
,
Shoukang Hu
,
Xixin Wu
,
Xunying Liu
,
Helen Meng
Exploring linguistic feature and model combination for speech recognition based automatic AD detection.
CoRR
(2022)
Xixin Wu
,
Shoukang Hu
,
Zhiyong Wu
,
Xunying Liu
,
Helen Meng
Neural Architecture Search for Speech Emotion Recognition.
ICASSP
(2022)
Yi Wang
,
Tianzi Wang
,
Zi Ye
,
Lingwei Meng
,
Shoukang Hu
,
Xixin Wu
,
Xunying Liu
,
Helen Meng
Exploring linguistic feature and model combination for speech recognition based automatic AD detection.
INTERSPEECH
(2022)
Haibin Wu
,
Lingwei Meng
,
Jiawen Kang
,
Jinchao Li
,
Xu Li
,
Xixin Wu
,
Hung-yi Lee
,
Helen Meng
Spoofing-Aware Speaker Verification by Multi-Level Fusion.
INTERSPEECH
(2022)
Disong Wang
,
Songxiang Liu
,
Xixin Wu
,
Hui Lu
,
Lifa Sun
,
Xunying Liu
,
Helen Meng
Speaker Identity Preservation in Dysarthric Speech Reconstruction by Adversarial Speaker Adaptation.
ICASSP
(2022)
Xueyuan Chen
,
Qiaochu Huang
,
Xixin Wu
,
Zhiyong Wu
,
Helen Meng
HILvoice:Human-in-the-Loop Style Selection for Elder-Facing Speech Synthesis.
ISCSLP
(2022)
HoLam Chung
,
Junan Li
,
Pengfei Liu
,
Wai-Kim Leung
,
Xixin Wu
,
Helen Meng
Improving Rare Words Recognition through Homophone Extension and Unified Writing for Low-resource Cantonese Speech Recognition.
ISCSLP
(2022)
Haibin Wu
,
Lingwei Meng
,
Jiawen Kang
,
Jinchao Li
,
Xu Li
,
Xixin Wu
,
Hung-yi Lee
,
Helen Meng
Spoofing-Aware Speaker Verification by Multi-Level Fusion.
CoRR
(2022)
Haibin Wu
,
Jiawen Kang
,
Lingwei Meng
,
Yang Zhang
,
Xixin Wu
,
Zhiyong Wu
,
Hung-yi Lee
,
Helen Meng
Tackling Spoofing-Aware Speaker Verification with Multi-Model Fusion.
CoRR
(2022)
Hui Lu
,
Disong Wang
,
Xixin Wu
,
Zhiyong Wu
,
Xunying Liu
,
Helen Meng
Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using ß-VAE.
SLT
(2022)
Haohan Guo
,
Hui Lu
,
Xixin Wu
,
Helen Meng
A Multi-Scale Time-Frequency Spectrogram Discriminator for GAN-based Non-Autoregressive TTS.
CoRR
(2022)
Haohan Guo
,
Feng-Long Xie
,
Frank K. Soong
,
Xixin Wu
,
Helen Meng
A Multi-Stage Multi-Codebook VQ-VAE Approach to High-Performance Neural TTS.
INTERSPEECH
(2022)
Kun Li
,
Tianhua Zhang
,
Liping Tang
,
Junan Li
,
Hongyuan Lu
,
Xixin Wu
,
Helen Meng
Grounded Dialogue Generation with Cross-encoding Re-ranker, Grounding Span Prediction, and Passage Dropout.
DialDoc@ACL
(2022)
Naijun Zheng
,
Na Li
,
Xixin Wu
,
Lingwei Meng
,
Jiawen Kang
,
Haibin Wu
,
Chao Weng
,
Dan Su
,
Helen Meng
The CUHK-TENCENT speaker diarization system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge.
CoRR
(2022)
Jixiu Li
,
Yisen Huang
,
Wing Yin Ng
,
Truman Cheng
,
Xixin Wu
,
Qi Dou
,
Helen Meng
,
Pheng-Ann Heng
,
Yunhui Liu
,
Shannon Melissa Chan
,
David Navarro-Alarcon
,
Calvin Sze Hang Ng
,
Philip Wai Yan Chiu
,
Zheng Li
Speech-Vision Based Multi-Modal AI Control of a Magnetic Anchored and Actuated Endoscope.
ROBIO
(2022)
Haibin Wu
,
Bo Zheng
,
Xu Li
,
Xixin Wu
,
Hung-yi Lee
,
Helen Meng
Characterizing the adversarial vulnerability of speech self-supervised learning.
CoRR
(2021)
Qingyun Dou
,
Yiting Lu
,
Potsawee Manakul
,
Xixin Wu
,
Mark J. F. Gales
Attention Forcing for Machine Translation.
CoRR
(2021)
Hui Lu
,
Zhiyong Wu
,
Xixin Wu
,
Xu Li
,
Shiyin Kang
,
Xunying Liu
,
Helen Meng
VAENAR-TTS: Variational Auto-Encoder Based Non-AutoRegressive Text-to-Speech Synthesis.
Interspeech
(2021)
Songxiang Liu
,
Yuewen Cao
,
Disong Wang
,
Xixin Wu
,
Xunying Liu
,
Helen Meng
Any-to-Many Voice Conversion With Location-Relative Sequence-to-Sequence Modeling.
IEEE ACM Trans. Audio Speech Lang. Process.
29 (2021)
Xu Li
,
Xixin Wu
,
Hui Lu
,
Xunying Liu
,
Helen Meng
Channel-Wise Gated Res2Net: Towards Robust Detection of Synthetic Speech Attacks.
Interspeech
(2021)
Xixin Wu
,
Mark J. F. Gales
Should Ensemble Members Be Calibrated?
CoRR
(2021)
Disong Wang
,
Jianwei Yu
,
Xixin Wu
,
Lifa Sun
,
Xunying Liu
,
Helen Meng
Improved End-to-End Dysarthric Speech Recognition via Meta-learning Based Model Re-initialization.
ISCSLP
(2021)
Disong Wang
,
Songxiang Liu
,
Lifa Sun
,
Xixin Wu
,
Xunying Liu
,
Helen Meng
Learning Explicit Prosody Models and Deep Speaker Embeddings for Atypical Voice Conversion.
Interspeech
(2021)
Xixin Wu
,
Yuewen Cao
,
Hui Lu
,
Songxiang Liu
,
Shiyin Kang
,
Zhiyong Wu
,
Xunying Liu
,
Helen Meng
Exemplar-Based Emotive Speech Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process.
29 (2021)
Qingyun Dou
,
Xixin Wu
,
Moquan Wan
,
Yiting Lu
,
Mark J. F. Gales
Deliberation-Based Multi-Pass Speech Synthesis.
Interspeech
(2021)
Hui Lu
,
Zhiyong Wu
,
Xixin Wu
,
Xu Li
,
Shiyin Kang
,
Xunying Liu
,
Helen Meng
VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.
CoRR
(2021)
Xixin Wu
,
Yuewen Cao
,
Hui Lu
,
Songxiang Liu
,
Disong Wang
,
Zhiyong Wu
,
Xunying Liu
,
Helen Meng
Speech Emotion Recognition Using Sequential Capsule Networks.
IEEE ACM Trans. Audio Speech Lang. Process.
29 (2021)
Xu Li
,
Xixin Wu
,
Hui Lu
,
Xunying Liu
,
Helen Meng
Channel-wise Gated Res2Net: Towards Robust Detection of Synthetic Speech Attacks.
CoRR
(2021)
Yuewen Cao
,
Songxiang Liu
,
Xixin Wu
,
Shiyin Kang
,
Peng Liu
,
Zhiyong Wu
,
Xunying Liu
,
Dan Su
,
Dong Yu
,
Helen Meng
Code-Switched Speech Synthesis Using Bilingual Phonetic Posteriorgram with Only Monolingual Corpora.
ICASSP
(2020)
Naijun Zheng
,
Xixin Wu
,
Jinghua Zhong
,
Xunying Liu
,
Helen Meng
Speaker-Aware Linear Discriminant Analysis in Speaker Verification.
INTERSPEECH
(2020)
Disong Wang
,
Jianwei Yu
,
Xixin Wu
,
Songxiang Liu
,
Lifa Sun
,
Xunying Liu
,
Helen Meng
End-To-End Voice Conversion Via Cross-Modal Knowledge Distillation for Dysarthric Speech Reconstruction.
ICASSP
(2020)
Kate M. Knill
,
Linlin Wang
,
Yu Wang
,
Xixin Wu
,
Mark J. F. Gales
Non-Native Children's Automatic Speech Recognition: The INTERSPEECH 2020 Shared Task ALTA Systems.
INTERSPEECH
(2020)
Xixin Wu
,
Kate M. Knill
,
Mark J. F. Gales
,
Andrey Malinin
Ensemble Approaches for Uncertainty in Spoken Language Assessment.
INTERSPEECH
(2020)