Sign in
Xixin Wu
ORCID
Publication Activity (10 Years)
Years Active: 2012-2024
Publications (10 Years): 110
Top Topics
Speech Recognition
Neural Network
Autoregressive
Gaussian Process
Top Venues
CoRR
ICASSP
INTERSPEECH
IEEE ACM Trans. Audio Speech Lang. Process.
</>
Publications
</>
Xueyuan Chen
,
Yuejiao Wang
,
Xixin Wu
,
Disong Wang
,
Zhiyong Wu
,
Xunying Liu
,
Helen Meng
Exploiting Audio-Visual Features with Pretrained AV-HuBERT for Multi-Modal Dysarthric Speech Reconstruction.
CoRR
(2024)
Yuejiao Wang
,
Xixin Wu
,
Disong Wang
,
Lingwei Meng
,
Helen Meng
UNIT-DSR: Dysarthric Speech Reconstruction System Using Speech Unit Normalization.
CoRR
(2024)
Jiawen Kang
,
Lingwei Meng
,
Mingyu Cui
,
Haohan Guo
,
Xixin Wu
,
Xunying Liu
,
Helen Meng
Cross-Speaker Encoding Network for Multi-Talker Speech Recognition.
CoRR
(2024)
Jinchao Li
,
Xixin Wu
,
Kaitao Song
,
Dongsheng Li
,
Xunying Liu
,
Helen Meng
A Hierarchical Regression Chain Framework for Affective Vocal Burst Recognition.
ICASSP
(2023)
Shun Lei
,
Yixuan Zhou
,
Liyang Chen
,
Zhiyong Wu
,
Xixin Wu
,
Shiyin Kang
,
Helen Meng
MSStyleTTS: Multi-Scale Style Modeling With Hierarchical Context Information for Expressive Speech Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process.
31 (2023)
Jinchao Li
,
Kaitao Song
,
Junan Li
,
Bo Zheng
,
Dongsheng Li
,
Xixin Wu
,
Xunying Liu
,
Helen Meng
Leveraging Pretrained Representations with Task-related Keywords for Alzheimer's Disease Detection.
CoRR
(2023)
Wen Wu
,
Chao Zhang
,
Xixin Wu
,
Philip C. Woodland
Estimating the Uncertainty in Emotion Class Labels With Utterance-Specific Dirichlet Priors.
IEEE Trans. Affect. Comput.
14 (4) (2023)
Jinchao Li
,
Kaitao Song
,
Junan Li
,
Bo Zheng
,
Dongsheng Li
,
Xixin Wu
,
Xunying Liu
,
Helen Meng
Leveraging Pretrained Representations With Task-Related Keywords for Alzheimer's Disease Detection.
ICASSP
(2023)
Lingwei Meng
,
Jiawen Kang
,
Mingyu Cui
,
Haibin Wu
,
Xixin Wu
,
Helen Meng
Unified Modeling of Multi-Talker Overlapped Speech Recognition and Diarization with a Sidecar Separator.
CoRR
(2023)
Hongyin Luo
,
Tianhua Zhang
,
Yung-Sung Chuang
,
Yuan Gong
,
Yoon Kim
,
Xixin Wu
,
Helen Meng
,
James R. Glass
Search Augmented Instruction Learning.
EMNLP (Findings)
(2023)
Haohan Guo
,
Fenglong Xie
,
Jiawen Kang
,
Yujia Xiao
,
Xixin Wu
,
Helen Meng
QS-TTS: Towards Semi-Supervised Text-to-Speech Synthesis via Vector-Quantized Self-Supervised Speech Representation Learning.
CoRR
(2023)
HoLam Chung
,
Junan Li
,
Pengfei Liu
,
Wai-Kim Leung
,
Xixin Wu
,
Helen Meng
Improving Rare Words Recognition through Homophone Extension and Unified Writing for Low-resource Cantonese Speech Recognition.
CoRR
(2023)
Dongchao Yang
,
Jinchuan Tian
,
Xu Tan
,
Rongjie Huang
,
Songxiang Liu
,
Xuankai Chang
,
Jiatong Shi
,
Sheng Zhao
,
Jiang Bian
,
Xixin Wu
,
Zhou Zhao
,
Shinji Watanabe
,
Helen Meng
UniAudio: An Audio Foundation Model Toward Universal Audio Generation.
CoRR
(2023)
Tianhua Zhang
,
Jiaxin Ge
,
Hongyin Luo
,
Yung-Sung Chuang
,
Mingye Gao
,
Yuan Gong
,
Xixin Wu
,
Yoon Kim
,
Helen Meng
,
James R. Glass
Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning.
CoRR
(2023)
Tianhua Zhang
,
Hongyin Luo
,
Yung-Sung Chuang
,
Wei Fang
,
Luc Gaitskell
,
Thomas Hartvigsen
,
Xixin Wu
,
Danny Fox
,
Helen Meng
,
James R. Glass
Interpretable Unified Language Checking.
CoRR
(2023)
Haohan Guo
,
Fenglong Xie
,
Xixin Wu
,
Frank K. Soong
,
Helen Meng
MSMC-TTS: Multi-Stage Multi-Codebook VQ-VAE Based Neural TTS.
IEEE ACM Trans. Audio Speech Lang. Process.
31 (2023)
Hongyin Luo
,
Yung-Sung Chuang
,
Yuan Gong
,
Tianhua Zhang
,
Yoon Kim
,
Xixin Wu
,
Danny Fox
,
Helen Meng
,
James R. Glass
SAIL: Search-Augmented Instruction Learning.
CoRR
(2023)
Yuhao Liu
,
Cheng Gong
,
Longbiao Wang
,
Xixin Wu
,
Qiuyu Liu
,
Jianwu Dang
VF-Taco2: Towards Fast and Lightweight Synthesis for Autoregressive Models with Variation Autoencoder and Feature Distillation.
ICASSP
(2023)
Shun Lei
,
Yixuan Zhou
,
Liyang Chen
,
Zhiyong Wu
,
Xixin Wu
,
Shiyin Kang
,
Helen Meng
MSStyleTTS: Multi-Scale Style Modeling with Hierarchical Context Information for Expressive Speech Synthesis.
CoRR
(2023)
Xiaohan Feng
,
Xixin Wu
,
Helen Meng
Injecting linguistic knowledge into BERT for Dialogue State Tracking.
CoRR
(2023)
Jingyan Zhou
,
Minda Hu
,
Junan Li
,
Xiaoying Zhang
,
Xixin Wu
,
Irwin King
,
Helen Meng
Rethinking Machine Ethics - Can LLMs Perform Moral Reasoning through the Lens of Moral Theories?
CoRR
(2023)
Lingwei Meng
,
Jiawen Kang
,
Mingyu Cui
,
Yuejiao Wang
,
Xixin Wu
,
Helen Meng
A Sidecar Separator Can Convert a Single-Speaker Speech Recognition System to a Multi-Speaker One.
CoRR
(2023)
Jie Chen
,
Changhe Song
,
Deyi Tuo
,
Xixin Wu
,
Shiyin Kang
,
Zhiyong Wu
,
Helen Meng
Improving Mandarin Prosodic Structure Prediction with Multi-level Contextual Information.
CoRR
(2023)
Hui Lu
,
Xixin Wu
,
Zhiyong Wu
,
Helen Meng
SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody.
ACM Multimedia
(2023)
Boshi Tang
,
Zhiyong Wu
,
Xixin Wu
,
Qiaochu Huang
,
Jun Chen
,
Shun Lei
,
Helen Meng
SimCalib: Graph Neural Network Calibration based on Similarity between Nodes.
CoRR
(2023)
Shun Lei
,
Yixuan Zhou
,
Liyang Chen
,
Dan Luo
,
Zhiyong Wu
,
Xixin Wu
,
Shiyin Kang
,
Tao Jiang
,
Yahui Zhou
,
Yuxing Han
,
Helen Meng
Improving Language Model-Based Zero-Shot Text-to-Speech Synthesis with Multi-Scale Acoustic Prompts.
CoRR
(2023)
Xueyuan Chen
,
Xi Wang
,
Shaofei Zhang
,
Lei He
,
Zhiyong Wu
,
Xixin Wu
,
Helen Meng
StyleSpeech: Self-supervised Style Enhancing with VQ-VAE-based Pre-training for Expressive Audiobook Speech Synthesis.
CoRR
(2023)
Jinchao Li
,
Xixin Wu
,
Kaitao Song
,
Dongsheng Li
,
Xunying Liu
,
Helen Meng
A Hierarchical Regression Chain Framework for Affective Vocal Burst Recognition.
CoRR
(2023)
Lingwei Meng
,
Jiawen Kang
,
Mingyu Cui
,
Yuejiao Wang
,
Xixin Wu
,
Helen Meng
A Sidecar Separator Can Convert A Single-Talker Speech Recognition System to A Multi-Talker One.
ICASSP
(2023)
Xixin Wu
,
Hui Lu
,
Kun Li
,
Zhiyong Wu
,
Xunying Liu
,
Helen Meng
Hiformer: Sequence Modeling Networks With Hierarchical Attention Mechanisms.
IEEE ACM Trans. Audio Speech Lang. Process.
31 (2023)
Jingbei Li
,
Yi Meng
,
Xixin Wu
,
Zhiyong Wu
,
Jia Jia
,
Helen Meng
,
Qiao Tian
,
Yuping Wang
,
Yuxuan Wang
Inferring Speaking Styles from Multi-modal Conversational Context by Multi-scale Relational Graph Convolutional Networks.
ACM Multimedia
(2022)
Xixin Wu
,
Shoukang Hu
,
Zhiyong Wu
,
Xunying Liu
,
Helen Meng
Neural Architecture Search for Speech Emotion Recognition.
CoRR
(2022)
Disong Wang
,
Songxiang Liu
,
Xixin Wu
,
Hui Lu
,
Lifa Sun
,
Xunying Liu
,
Helen Meng
Speaker Identity Preservation in Dysarthric Speech Reconstruction by Adversarial Speaker Adaptation.
CoRR
(2022)
Hui Lu
,
Disong Wang
,
Xixin Wu
,
Zhiyong Wu
,
Xunying Liu
,
Helen Meng
Disentangled Speech Representation Learning for One-Shot Cross-lingual Voice Conversion Using β-VAE.
CoRR
(2022)
Haibin Wu
,
Jiawen Kang
,
Lingwei Meng
,
Yang Zhang
,
Xixin Wu
,
Zhiyong Wu
,
Hung-yi Lee
,
Helen Meng
Tackling Spoofing-Aware Speaker Verification with Multi-Model Fusion.
Odyssey
(2022)
Jie Chen
,
Changhe Song
,
Deyi Tuo
,
Xixin Wu
,
Shiyin Kang
,
Zhiyong Wu
,
Helen Meng
Improving Mandarin Prosodic Structure Prediction with Multi-level Contextual Information.
INTERSPEECH
(2022)
Wen Wu
,
Chao Zhang
,
Xixin Wu
,
Philip C. Woodland
Estimating the Uncertainty in Emotion Class Labels with Utterance-Specific Dirichlet Priors.
CoRR
(2022)
Haohan Guo
,
Fenglong Xie
,
Xixin Wu
,
Hui Lu
,
Helen Meng
Towards High-Quality Neural TTS for Low-Resource Languages by Learning Compact Speech Representations.
CoRR
(2022)
Haohan Guo
,
Hui Lu
,
Xixin Wu
,
Helen Meng
A Multi-Scale Time-Frequency Spectrogram Discriminator for GAN-based Non-Autoregressive TTS.
INTERSPEECH
(2022)
Haibin Wu
,
Bo Zheng
,
Xu Li
,
Xixin Wu
,
Hung-Yi Lee
,
Helen Meng
Characterizing the Adversarial Vulnerability of Speech self-Supervised Learning.
ICASSP
(2022)
Hang Su
,
Danyang Zhao
,
Long Dang
,
Minglei Li
,
Xixin Wu
,
Xunying Liu
,
Helen Meng
A Multitask Learning Framework for Speaker Change Detection with Content Information from Unsupervised Speech Decomposition.
ICASSP
(2022)
Naijun Zheng
,
Na Li
,
Xixin Wu
,
Lingwei Meng
,
Jiawen Kang
,
Haibin Wu
,
Chao Weng
,
Dan Su
,
Helen Meng
The CUHK-Tencent Speaker Diarization System for the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Challenge.
ICASSP
(2022)
Haohan Guo
,
Feng-Long Xie
,
Frank K. Soong
,
Xixin Wu
,
Helen Meng
A Multi-Stage Multi-Codebook VQ-VAE Approach to High-Performance Neural TTS.
CoRR
(2022)
Yi Wang
,
Tianzi Wang
,
Zi Ye
,
Lingwei Meng
,
Shoukang Hu
,
Xixin Wu
,
Xunying Liu
,
Helen Meng
Exploring linguistic feature and model combination for speech recognition based automatic AD detection.
CoRR
(2022)
Xixin Wu
,
Shoukang Hu
,
Zhiyong Wu
,
Xunying Liu
,
Helen Meng
Neural Architecture Search for Speech Emotion Recognition.
ICASSP
(2022)
Yi Wang
,
Tianzi Wang
,
Zi Ye
,
Lingwei Meng
,
Shoukang Hu
,
Xixin Wu
,
Xunying Liu
,
Helen Meng
Exploring linguistic feature and model combination for speech recognition based automatic AD detection.
INTERSPEECH
(2022)
Haibin Wu
,
Lingwei Meng
,
Jiawen Kang
,
Jinchao Li
,
Xu Li
,
Xixin Wu
,
Hung-yi Lee
,
Helen Meng
Spoofing-Aware Speaker Verification by Multi-Level Fusion.
INTERSPEECH
(2022)
Disong Wang
,
Songxiang Liu
,
Xixin Wu
,
Hui Lu
,
Lifa Sun
,
Xunying Liu
,
Helen Meng
Speaker Identity Preservation in Dysarthric Speech Reconstruction by Adversarial Speaker Adaptation.
ICASSP
(2022)
Xueyuan Chen
,
Qiaochu Huang
,
Xixin Wu
,
Zhiyong Wu
,
Helen Meng
HILvoice:Human-in-the-Loop Style Selection for Elder-Facing Speech Synthesis.
ISCSLP
(2022)
HoLam Chung
,
Junan Li
,
Pengfei Liu
,
Wai-Kim Leung
,
Xixin Wu
,
Helen Meng
Improving Rare Words Recognition through Homophone Extension and Unified Writing for Low-resource Cantonese Speech Recognition.
ISCSLP
(2022)
Haibin Wu
,
Lingwei Meng
,
Jiawen Kang
,
Jinchao Li
,
Xu Li
,
Xixin Wu
,
Hung-yi Lee
,
Helen Meng
Spoofing-Aware Speaker Verification by Multi-Level Fusion.
CoRR
(2022)
Haibin Wu
,
Jiawen Kang
,
Lingwei Meng
,
Yang Zhang
,
Xixin Wu
,
Zhiyong Wu
,
Hung-yi Lee
,
Helen Meng
Tackling Spoofing-Aware Speaker Verification with Multi-Model Fusion.
CoRR
(2022)
Hui Lu
,
Disong Wang
,
Xixin Wu
,
Zhiyong Wu
,
Xunying Liu
,
Helen Meng
Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using ß-VAE.
SLT
(2022)
Haohan Guo
,
Hui Lu
,
Xixin Wu
,
Helen Meng
A Multi-Scale Time-Frequency Spectrogram Discriminator for GAN-based Non-Autoregressive TTS.
CoRR
(2022)
Haohan Guo
,
Feng-Long Xie
,
Frank K. Soong
,
Xixin Wu
,
Helen Meng
A Multi-Stage Multi-Codebook VQ-VAE Approach to High-Performance Neural TTS.
INTERSPEECH
(2022)
Kun Li
,
Tianhua Zhang
,
Liping Tang
,
Junan Li
,
Hongyuan Lu
,
Xixin Wu
,
Helen Meng
Grounded Dialogue Generation with Cross-encoding Re-ranker, Grounding Span Prediction, and Passage Dropout.
DialDoc@ACL
(2022)
Naijun Zheng
,
Na Li
,
Xixin Wu
,
Lingwei Meng
,
Jiawen Kang
,
Haibin Wu
,
Chao Weng
,
Dan Su
,
Helen Meng
The CUHK-TENCENT speaker diarization system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge.
CoRR
(2022)
Jixiu Li
,
Yisen Huang
,
Wing Yin Ng
,
Truman Cheng
,
Xixin Wu
,
Qi Dou
,
Helen Meng
,
Pheng-Ann Heng
,
Yunhui Liu
,
Shannon Melissa Chan
,
David Navarro-Alarcon
,
Calvin Sze Hang Ng
,
Philip Wai Yan Chiu
,
Zheng Li
Speech-Vision Based Multi-Modal AI Control of a Magnetic Anchored and Actuated Endoscope.
ROBIO
(2022)
Haibin Wu
,
Bo Zheng
,
Xu Li
,
Xixin Wu
,
Hung-yi Lee
,
Helen Meng
Characterizing the adversarial vulnerability of speech self-supervised learning.
CoRR
(2021)
Qingyun Dou
,
Yiting Lu
,
Potsawee Manakul
,
Xixin Wu
,
Mark J. F. Gales
Attention Forcing for Machine Translation.
CoRR
(2021)
Hui Lu
,
Zhiyong Wu
,
Xixin Wu
,
Xu Li
,
Shiyin Kang
,
Xunying Liu
,
Helen Meng
VAENAR-TTS: Variational Auto-Encoder Based Non-AutoRegressive Text-to-Speech Synthesis.
Interspeech
(2021)
Songxiang Liu
,
Yuewen Cao
,
Disong Wang
,
Xixin Wu
,
Xunying Liu
,
Helen Meng
Any-to-Many Voice Conversion With Location-Relative Sequence-to-Sequence Modeling.
IEEE ACM Trans. Audio Speech Lang. Process.
29 (2021)
Xu Li
,
Xixin Wu
,
Hui Lu
,
Xunying Liu
,
Helen Meng
Channel-Wise Gated Res2Net: Towards Robust Detection of Synthetic Speech Attacks.
Interspeech
(2021)
Xixin Wu
,
Mark J. F. Gales
Should Ensemble Members Be Calibrated?
CoRR
(2021)
Disong Wang
,
Jianwei Yu
,
Xixin Wu
,
Lifa Sun
,
Xunying Liu
,
Helen Meng
Improved End-to-End Dysarthric Speech Recognition via Meta-learning Based Model Re-initialization.
ISCSLP
(2021)
Disong Wang
,
Songxiang Liu
,
Lifa Sun
,
Xixin Wu
,
Xunying Liu
,
Helen Meng
Learning Explicit Prosody Models and Deep Speaker Embeddings for Atypical Voice Conversion.
Interspeech
(2021)
Xixin Wu
,
Yuewen Cao
,
Hui Lu
,
Songxiang Liu
,
Shiyin Kang
,
Zhiyong Wu
,
Xunying Liu
,
Helen Meng
Exemplar-Based Emotive Speech Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process.
29 (2021)
Qingyun Dou
,
Xixin Wu
,
Moquan Wan
,
Yiting Lu
,
Mark J. F. Gales
Deliberation-Based Multi-Pass Speech Synthesis.
Interspeech
(2021)
Hui Lu
,
Zhiyong Wu
,
Xixin Wu
,
Xu Li
,
Shiyin Kang
,
Xunying Liu
,
Helen Meng
VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.
CoRR
(2021)
Xixin Wu
,
Yuewen Cao
,
Hui Lu
,
Songxiang Liu
,
Disong Wang
,
Zhiyong Wu
,
Xunying Liu
,
Helen Meng
Speech Emotion Recognition Using Sequential Capsule Networks.
IEEE ACM Trans. Audio Speech Lang. Process.
29 (2021)
Xu Li
,
Xixin Wu
,
Hui Lu
,
Xunying Liu
,
Helen Meng
Channel-wise Gated Res2Net: Towards Robust Detection of Synthetic Speech Attacks.
CoRR
(2021)
Yuewen Cao
,
Songxiang Liu
,
Xixin Wu
,
Shiyin Kang
,
Peng Liu
,
Zhiyong Wu
,
Xunying Liu
,
Dan Su
,
Dong Yu
,
Helen Meng
Code-Switched Speech Synthesis Using Bilingual Phonetic Posteriorgram with Only Monolingual Corpora.
ICASSP
(2020)
Naijun Zheng
,
Xixin Wu
,
Jinghua Zhong
,
Xunying Liu
,
Helen Meng
Speaker-Aware Linear Discriminant Analysis in Speaker Verification.
INTERSPEECH
(2020)
Disong Wang
,
Jianwei Yu
,
Xixin Wu
,
Songxiang Liu
,
Lifa Sun
,
Xunying Liu
,
Helen Meng
End-To-End Voice Conversion Via Cross-Modal Knowledge Distillation for Dysarthric Speech Reconstruction.
ICASSP
(2020)
Kate M. Knill
,
Linlin Wang
,
Yu Wang
,
Xixin Wu
,
Mark J. F. Gales
Non-Native Children's Automatic Speech Recognition: The INTERSPEECH 2020 Shared Task ALTA Systems.
INTERSPEECH
(2020)
Xixin Wu
,
Kate M. Knill
,
Mark J. F. Gales
,
Andrey Malinin
Ensemble Approaches for Uncertainty in Spoken Language Assessment.
INTERSPEECH
(2020)
Xu Li
,
Na Li
,
Jinghua Zhong
,
Xixin Wu
,
Xunying Liu
,
Dan Su
,
Dong Yu
,
Helen Meng
Investigating Robustness of Adversarial Samples Detection for Automatic Speaker Verification.
CoRR
(2020)
Xu Li
,
Xixin Wu
,
Xunying Liu
,
Helen Meng
Deep segmental phonetic posterior-grams based discovery of non-categories in L2 English speech.
CoRR
(2020)
Songxiang Liu
,
Disong Wang
,
Yuewen Cao
,
Lifa Sun
,
Xixin Wu
,
Shiyin Kang
,
Zhiyong Wu
,
Xunying Liu
,
Dan Su
,
Dong Yu
,
Helen Meng
End-To-End Accent Conversion Without Using Native Utterances.
ICASSP
(2020)
Xu Li
,
Jinghua Zhong
,
Jianwei Yu
,
Shoukang Hu
,
Xixin Wu
,
Xunying Liu
,
Helen Meng
Bayesian x-vector: Bayesian Neural Network based x-vector System for Speaker Verification.
CoRR
(2020)
Songxiang Liu
,
Yuewen Cao
,
Disong Wang
,
Xixin Wu
,
Xunying Liu
,
Helen Meng
Any-to-Many Voice Conversion with Location-Relative Sequence-to-Sequence Modeling.
CoRR
(2020)
Xu Li
,
Jinghua Zhong
,
Jianwei Yu
,
Shoukang Hu
,
Xixin Wu
,
Xunying Liu
,
Helen Meng
Bayesian x-vector: Bayesian Neural Network based x-vector System for Speaker Verification.
Odyssey
(2020)
Xu Li
,
Na Li
,
Jinghua Zhong
,
Xixin Wu
,
Xunying Liu
,
Dan Su
,
Dong Yu
,
Helen Meng
Investigating Robustness of Adversarial Samples Detection for Automatic Speaker Verification.
INTERSPEECH
(2020)
Xu Li
,
Jinghua Zhong
,
Xixin Wu
,
Jianwei Yu
,
Xunying Liu
,
Helen Meng
Adversarial Attacks on GMM I-Vector Based Speaker Verification Systems.
ICASSP
(2020)
Disong Wang
,
Songxiang Liu
,
Lifa Sun
,
Xixin Wu
,
Xunying Liu
,
Helen Meng
Learning Explicit Prosody Models and Deep Speaker Embeddings for Atypical Voice Conversion.
CoRR
(2020)
Jianwei Yu
,
Max W. Y. Lam
,
Xie Chen
,
Shoukang Hu
,
Songxiang Liu
,
Xixin Wu
,
Xunying Liu
,
Helen Meng
Recurrent Neural Network Language Model Training Using Natural Gradient.
ICASSP
(2019)
Hang Su
,
Borislav Dzodzo
,
Xixin Wu
,
Xunying Liu
,
Helen Meng
Unsupervised Methods for Audio Classification from Lecture Discussion Recordings.
INTERSPEECH
(2019)
Dongyang Dai
,
Zhiyong Wu
,
Runnan Li
,
Xixin Wu
,
Jia Jia
,
Helen Meng
Learning Discriminative Features from Spectrograms Using Center Loss for Speech Emotion Recognition.
ICASSP
(2019)
Yuewen Cao
,
Xixin Wu
,
Songxiang Liu
,
Jianwei Yu
,
Xu Li
,
Zhiyong Wu
,
Xunying Liu
,
Helen Meng
End-to-end Code-switched TTS with Mix of Monolingual Recordings.
ICASSP
(2019)
Jianwei Yu
,
Max W. Y. Lam
,
Shoukang Hu
,
Xixin Wu
,
Xu Li
,
Yuewen Cao
,
Xunying Liu
,
Helen Meng
Comparative Study of Parametric and Representation Uncertainty Modeling for Recurrent Neural Network Language Models.
INTERSPEECH
(2019)
Shoukang Hu
,
Xurong Xie
,
Shansong Liu
,
Max W. Y. Lam
,
Jianwei Yu
,
Xixin Wu
,
Xunying Liu
,
Helen Meng
LF-MMI Training of Bayesian and Gaussian Process Time Delay Neural Networks for Speech Recognition.
INTERSPEECH
(2019)
Peng Liu
,
Xixin Wu
,
Shiyin Kang
,
Guangzhi Li
,
Dan Su
,
Dong Yu
Maximizing Mutual Information for Tacotron.
CoRR
(2019)
Mu Wang
,
Xixin Wu
,
Zhiyong Wu
,
Shiyin Kang
,
Deyi Tuo
,
Guangzhi Li
,
Dan Su
,
Dong Yu
,
Helen Meng
Quasi-fully Convolutional Neural Network with Variational Inference for Speech Synthesis.
ICASSP
(2019)
Dongyang Dai
,
Zhiyong Wu
,
Shiyin Kang
,
Xixin Wu
,
Jia Jia
,
Dan Su
,
Dong Yu
,
Helen Meng
Disambiguation of Chinese Polyphones in an End-to-End Framework with Semantic Features Extracted by Pre-Trained BERT.
INTERSPEECH
(2019)
Xixin Wu
,
Songxiang Liu
,
Yuewen Cao
,
Xu Li
,
Jianwei Yu
,
Dongyang Dai
,
Xi Ma
,
Shoukang Hu
,
Zhiyong Wu
,
Xunying Liu
,
Helen Meng
Speech Emotion Recognition Using Capsule Networks.
ICASSP
(2019)
Xu Li
,
Jinghua Zhong
,
Xixin Wu
,
Jianwei Yu
,
Xunying Liu
,
Helen Meng
Adversarial Attacks on GMM i-vector based Speaker Verification Systems.
CoRR
(2019)
Ming Liao
,
Jing Li
,
Haisong Zhang
,
Lingzhi Wang
,
Xixin Wu
,
Kam-Fai Wong
Coupling Global and Local Context for Unsupervised Aspect Extraction.
EMNLP/IJCNLP (1)
(2019)
Shoukang Hu
,
Max W. Y. Lam
,
Xurong Xie
,
Shansong Liu
,
Jianwei Yu
,
Xixin Wu
,
Xunying Liu
,
Helen Meng
Bayesian and Gaussian Process Neural Networks for Large Vocabulary Continuous Speech Recognition.
ICASSP
(2019)
Songxiang Liu
,
Yuewen Cao
,
Xixin Wu
,
Lifa Sun
,
Xunying Liu
,
Helen Meng
Jointly Trained Conversion Model and WaveNet Vocoder for Non-Parallel Voice Conversion Using Mel-Spectrograms and Phonetic Posteriorgrams.
INTERSPEECH
(2019)
Xixin Wu
,
Yuewen Cao
,
Mu Wang
,
Songxiang Liu
,
Shiyin Kang
,
Zhiyong Wu
,
Xunying Liu
,
Dan Su
,
Dong Yu
,
Helen Meng
Rapid Style Adaptation Using Residual Error Embedding for Expressive Speech Synthesis.
INTERSPEECH
(2018)