Login / Signup
Rui Liu
ORCID
Publication Activity (10 Years)
Years Active: 2016-2024
Publications (10 Years): 60
Top Topics
Context Modeling
Emotion Recognition
Coarse Grained
Speech Synthesis
Top Venues
CoRR
IEEE ACM Trans. Audio Speech Lang. Process.
ICASSP
INTERSPEECH
</>
Publications
</>
Rui Liu
,
Jinhua Zhang
,
Guanglai Gao
Multi-space channel representation learning for mono-to-binaural conversion based audio deepfake detection.
Inf. Fusion
105 (2024)
Rui Liu
,
Haolin Zuo
,
Zheng Lian
,
Xiaofen Xing
,
Björn W. Schuller
,
Haizhou Li
Emotion and Intent Joint Understanding in Multimodal Conversation: A Benchmarking Dataset.
CoRR
(2024)
Rui Liu
,
Yifan Hu
,
Haolin Zuo
,
Zhaojie Luo
,
Longbiao Wang
,
Guanglai Gao
Text-to-Speech for Low-Resource Agglutinative Language With Morphology-Aware Language Model Pre-Training.
IEEE ACM Trans. Audio Speech Lang. Process.
32 (2024)
Jing Li
,
Yifan Hu
,
Jiulun Fan
,
Haiyan Yu
,
Bin Jia
,
Rui Liu
,
Feng Zhao
Modified suppressed relative entropy fuzzy c-means clustering algorithm.
J. Intell. Fuzzy Syst.
46 (3) (2024)
Rui Liu
,
Berrak Sisman
,
Guanglai Gao
,
Haizhou Li
Controllable Accented Text-to-Speech Synthesis With Fine and Coarse-Grained Intensity Rendering.
IEEE ACM Trans. Audio Speech Lang. Process.
32 (2024)
Rui Liu
,
Yifan Hu
,
Yi Ren
,
Xiang Yin
,
Haizhou Li
Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling.
AAAI
(2024)
Zheng Lian
,
Haiyang Sun
,
Licai Sun
,
Zhuofan Wen
,
Siyuan Zhang
,
Shun Chen
,
Hao Gu
,
Jinming Zhao
,
Ziyang Ma
,
Xie Chen
,
Jiangyan Yi
,
Rui Liu
,
Kele Xu
,
Bin Liu
,
Erik Cambria
,
Guoying Zhao
,
Björn W. Schuller
,
Jianhua Tao
MER 2024: Semi-Supervised Learning, Noise Robustness, and Open-Vocabulary Multimodal Emotion Recognition.
CoRR
(2024)
Rui Liu
,
Yifan Hu
,
Yi Ren
,
Xiang Yin
,
Haizhou Li
Generative Expressive Conversational Speech Synthesis.
CoRR
(2024)
Rui Liu
,
Jinhua Zhang
,
Guanglai Gao
,
Haizhou Li
Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion.
INTERSPEECH
(2023)
Rui Liu
,
Haolin Zuo
,
De Hu
,
Guanglai Gao
,
Haizhou Li
Explicit Intensity Control for Accented Text-to-speech.
INTERSPEECH
(2023)
Rui Liu
,
Yifan Hu
,
Yi Ren
,
Xiang Yin
,
Haizhou Li
Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling.
CoRR
(2023)
De Hu
,
Qintuya Si
,
Rui Liu
,
Feilong Bao
Distributed Sensor Selection for Speech Enhancement With Acoustic Sensor Networks.
IEEE ACM Trans. Audio Speech Lang. Process.
31 (2023)
Zhaojie Luo
,
Shoufeng Lin
,
Rui Liu
,
Jun Baba
,
Yuichiro Yoshikawa
,
Hiroshi Ishiguro
Decoupling Speaker-Independent Emotions for Voice Conversion via Source-Filter Networks.
IEEE ACM Trans. Audio Speech Lang. Process.
31 (2023)
Rui Liu
,
Jiatian Xi
,
Ziyue Jiang
,
Haizhou Li
FluentEditor: Text-based Speech Editing by Considering Acoustic and Prosody Consistency.
CoRR
(2023)
Haolin Zuo
,
Rui Liu
,
Jinming Zhao
,
Guanglai Gao
,
Haizhou Li
Exploiting Modality-Invariant Feature for Robust Multimodal Emotion Recognition with Missing Modalities.
ICASSP
(2023)
Qi Fan
,
Haolin Zuo
,
Rui Liu
,
Zheng Lian
,
Guanglai Gao
Learning Noise-Robust Joint Representation for Multimodal Emotion Recognition under Realistic Incomplete Data Scenarios.
CoRR
(2023)
Kailin Liang
,
Bin Liu
,
Yifan Hu
,
Rui Liu
,
Feilong Bao
,
Guanglai Gao
MnTTS2: An Open-Source Multi-Speaker Mongolian Text-to-Speech Synthesis Dataset.
CoRR
(2023)
Rui Liu
,
Bin Liu
,
Haizhou Li
Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech.
CoRR
(2023)
Rui Liu
,
Berrak Sisman
,
Björn W. Schuller
,
Guanglai Gao
,
Haizhou Li
Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning.
INTERSPEECH
(2022)
Yonghe Wang
,
Rui Liu
,
Feilong Bao
,
Hui Zhang
,
Guanglai Gao
Alignment-Learning Based Single-Step Decoding for Accurate and Fast Non-Autoregressive Speech Recognition.
ICASSP
(2022)
Haolin Zuo
,
Rui Liu
,
Jinming Zhao
,
Guanglai Gao
,
Haizhou Li
Exploiting modality-invariant feature for robust multimodal emotion recognition with missing modalities.
CoRR
(2022)
Rui Liu
,
Berrak Sisman
,
Guanglai Gao
,
Haizhou Li
Controllable Accented Text-to-Speech Synthesis.
CoRR
(2022)
Kun Zhou
,
Berrak Sisman
,
Rui Liu
,
Haizhou Li
Emotional voice conversion: Theory, databases and ESD.
Speech Commun.
137 (2022)
Yifan Hu
,
Pengkai Yin
,
Rui Liu
,
Feilong Bao
,
Guanglai Gao
MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline.
CoRR
(2022)
Muhan Na
,
Rui Liu
,
Feilong Bao
,
Guanglai Gao
A Deep Investigation of RNN and Self-attention for the Cyrillic-Traditional Mongolian Bidirectional Conversion.
ICONIP (6)
(2022)
Junchen Lu
,
Berrak Sisman
,
Rui Liu
,
Mingyang Zhang
,
Haizhou Li
Visualtts: TTS with Accurate Lip-Speech Synchronization for Automatic Voice Over.
ICASSP
(2022)
Yifan Hu
,
Pengkai Yin
,
Rui Liu
,
Feilong Bao
,
Guanglai Gao
MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline.
IALP
(2022)
Rui Liu
,
Haolin Zuo
,
De Hu
,
Guanglai Gao
,
Haizhou Li
Explicit Intensity Control for Accented Text-to-speech.
CoRR
(2022)
Yifan Hu
,
Rui Liu
,
Guanglai Gao
,
Haizhou Li
FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis.
CoRR
(2022)
Muhan Na
,
Rui Liu
,
Fei Long
,
Guanglai Gao
A Deep Investigation of RNN and Self-attention for the Cyrillic-Traditional Mongolian Bidirectional Conversion.
CoRR
(2022)
Rui Liu
,
Berrak Sisman
,
Björn W. Schuller
,
Guanglai Gao
,
Haizhou Li
Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning.
CoRR
(2022)
Rui Liu
,
Qi Liu
,
Hongxu Zhu
,
Hui Cao
Multistage Deep Transfer Learning for EmIoT-Enabled Human-Computer Interaction.
IEEE Internet Things J.
9 (16) (2022)
Rui Liu
,
Berrak Sisman
,
Guanglai Gao
,
Haizhou Li
Decoding Knowledge Transfer for Neural Text-to-Speech Training.
IEEE ACM Trans. Audio Speech Lang. Process.
30 (2022)
Rui Liu
,
Berrak Sisman
,
Haizhou Li
Reinforcement Learning for Emotional Text-to-Speech Synthesis with Improved Emotion Discriminability.
CoRR
(2021)
Zhaojie Luo
,
Shoufeng Lin
,
Rui Liu
,
Jun Baba
,
Yuichiro Yoshikawa
,
Hiroshi Ishiguro
Decoupling Speaker-Independent Emotions for Voice Conversion Via Source-Filter Networks.
CoRR
(2021)
Rui Liu
,
Berrak Sisman
,
Feilong Bao
,
Jichen Yang
,
Guanglai Gao
,
Haizhou Li
Exploiting Morphological and Phonological Features to Improve Prosodic Phrasing for Mongolian Speech Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process.
29 (2021)
Rui Liu
,
Berrak Sisman
,
Haizhou Li
StrengthNet: Deep Learning-based Emotion Strength Assessment for Emotional Speech Synthesis.
CoRR
(2021)
Junchen Lu
,
Berrak Sisman
,
Rui Liu
,
Mingyang Zhang
,
Haizhou Li
VisualTTS: TTS with Accurate Lip-Speech Synchronization for Automatic Voice Over.
CoRR
(2021)
Kun Zhou
,
Berrak Sisman
,
Rui Liu
,
Haizhou Li
Seen and Unseen Emotional Style Transfer for Voice Conversion with A New Emotional Speech Dataset.
ICASSP
(2021)
Kun Zhou
,
Berrak Sisman
,
Rui Liu
,
Haizhou Li
Emotional Voice Conversion: Theory, Databases and ESD.
CoRR
(2021)
Rui Liu
,
Berrak Sisman
,
Guanglai Gao
,
Haizhou Li
Expressive TTS Training With Frame and Style Reconstruction Loss.
IEEE ACM Trans. Audio Speech Lang. Process.
29 (2021)
Rui Liu
,
Berrak Sisman
,
Haizhou Li
Reinforcement Learning for Emotional Text-to-Speech Synthesis with Improved Emotion Discriminability.
Interspeech
(2021)
Aihong Huang
,
Feilong Bao
,
Guanglai Gao
,
Yu Shan
,
Rui Liu
Mongolian emotional speech synthesis based on transfer learning and emotional embedding.
IALP
(2021)
Rui Liu
,
Berrak Sisman
,
Yixing Lin
,
Haizhou Li
FastTalker: A neural text-to-speech architecture with shallow and group autoregression.
Neural Networks
141 (2021)
Rui Liu
,
Berrak Sisman
,
Haizhou Li
Graphspeech: Syntax-Aware Graph Attention Network for Neural Speech Synthesis.
ICASSP
(2021)
Rui Liu
,
Berrak Sisman
,
Guanglai Gao
,
Haizhou Li
Expressive TTS Training with Frame and Style Reconstruction Loss.
CoRR
(2020)
Rui Liu
,
Berrak Sisman
,
Feilong Bao
,
Guanglai Gao
,
Haizhou Li
WaveTTS: Tacotron-based TTS with Joint Time-Frequency Domain Loss.
CoRR
(2020)
Rui Liu
,
Berrak Sisman
,
Feilong Bao
,
Guanglai Gao
,
Haizhou Li
Modeling Prosodic Phrasing With Multi-Task Learning in Tacotron-Based TTS.
IEEE Signal Process. Lett.
27 (2020)
Rui Liu
,
Berrak Sisman
,
Jingdong Li
,
Feilong Bao
,
Guanglai Gao
,
Haizhou Li
Teacher-Student Training For Robust Tacotron-Based TTS.
ICASSP
(2020)
Kun Zhou
,
Berrak Sisman
,
Rui Liu
,
Haizhou Li
Seen and Unseen emotional style transfer for voice conversion with a new emotional speech dataset.
CoRR
(2020)
Rui Liu
,
Berrak Sisman
,
Haizhou Li
GraphSpeech: Syntax-Aware Graph Attention Network For Neural Speech Synthesis.
CoRR
(2020)
Rui Liu
,
Berrak Sisman
,
Feilong Bao
,
Guanglai Gao
,
Haizhou Li
WaveTTS: Tacotron-based TTS with Joint Time-Frequency Domain Loss.
Odyssey
(2020)
Rui Liu
,
Berrak Sisman
,
Feilong Bao
,
Guanglai Gao
,
Haizhou Li
Modeling Prosodic Phrasing with Multi-Task Learning in Tacotron-based TTS.
CoRR
(2020)
Rui Liu
,
Berrak Sisman
,
Jingdong Li
,
Feilong Bao
,
Guanglai Gao
,
Haizhou Li
Teacher-Student Training for Robust Tacotron-based TTS.
CoRR
(2019)
Rui Liu
,
Feilong Bao
,
Guanglai Gao
Building Mongolian TTS Front-End with Encoder-Decoder Model by Using Bridge Method and Multi-view Features.
ICONIP (5)
(2019)
Rui Liu
,
Feilong Bao
,
Guanglai Gao
,
Hui Zhang
,
Yonghe Wang
Improving Mongolian Phrase Break Prediction by Using Syllable and Morphological Embeddings with BiLSTM Model.
INTERSPEECH
(2018)
Jingdong Li
,
Hui Zhang
,
Rui Liu
,
Xueliang Zhang
,
Feilong Bao
End-to-End Mongolian Text-to-Speech System.
ISCSLP
(2018)
Rui Liu
,
Feilong Bao
,
Guanglai Gao
,
Hui Zhang
,
Yonghe Wang
A LSTM Approach with Sub-Word Embeddings for Mongolian Phrase Break Prediction.
COLING
(2018)
Rui Liu
,
Feilong Bao
,
Guanglai Gao
,
Hui Zhang
,
Yonghe Wang
Phonologically Aware BiLSTM Model for Mongolian Phrase Break Prediction with Attention Mechanism.
PRICAI (1)
(2018)
Rui Liu
,
Feilong Bao
,
Guanglai Gao
,
Weihua Wang
Mongolian prosodic phrase prediction using suffix segmentation.
IALP
(2016)