Login / Signup
Runnan Li
ORCID
Publication Activity (10 Years)
Years Active: 2016-2023
Publications (10 Years): 32
Top Topics
Speech Emotion Recognition
Autoregressive
Output Layer
Spoken Term Detection
Top Venues
ICASSP
CoRR
INTERSPEECH
ISCSLP
</>
Publications
</>
Zenghao Chai
,
Tianke Zhang
,
Tianyu He
,
Xu Tan
,
Tadas Baltrusaitis
,
HsiangTao Wu
,
Runnan Li
,
Sheng Zhao
,
Chun Yuan
,
Jiang Bian
HiFace: High-Fidelity 3D Face Reconstruction by Learning Static and Dynamic Details.
ICCV
(2023)
Jun Ling
,
Xu Tan
,
Liyang Chen
,
Runnan Li
,
Yuchao Zhang
,
Sheng Zhao
,
Li Song
StableFace: Analyzing and Improving Motion Stability for Talking Face Generation.
IEEE J. Sel. Top. Signal Process.
17 (6) (2023)
Zenghao Chai
,
Tianke Zhang
,
Tianyu He
,
Xu Tan
,
Tadas Baltrusaitis
,
HsiangTao Wu
,
Runnan Li
,
Sheng Zhao
,
Chun Yuan
,
Jiang Bian
HiFace: High-Fidelity 3D Face Reconstruction by Learning Static and Dynamic Details.
CoRR
(2023)
Liyang Chen
,
Zhiyong Wu
,
Runnan Li
,
Weihong Bao
,
Jun Ling
,
Xu Tan
,
Sheng Zhao
VAST: Vivify Your Talking Avatar via Zero-Shot Expressive Facial Style Transfer.
CoRR
(2023)
Shengmeng Li
,
Luping Liu
,
Zenghao Chai
,
Runnan Li
,
Xu Tan
ERA-Solver: Error-Robust Adams Solver for Fast Sampling of Diffusion Probabilistic Models.
CoRR
(2023)
Liyang Chen
,
Zhiyong Wu
,
Runnan Li
,
Weihong Bao
,
Jun Ling
,
Xu Tan
,
Sheng Zhao
VAST: Vivify Your Talking Avatar via Zero-Shot Expressive Facial Style Transfer.
ICCV (Workshops)
(2023)
Liyang Chen
,
Zhiyong Wu
,
Jun Ling
,
Runnan Li
,
Xu Tan
,
Sheng Zhao
Transformer-S2A: Robust and Efficient Speech-to-Animation.
ICASSP
(2022)
Jun Ling
,
Xu Tan
,
Liyang Chen
,
Runnan Li
,
Yuchao Zhang
,
Sheng Zhao
,
Li Song
StableFace: Analyzing and Improving Motion Stability for Talking Face Generation.
CoRR
(2022)
Anni Tang
,
Tianyu He
,
Xu Tan
,
Jun Ling
,
Runnan Li
,
Sheng Zhao
,
Li Song
,
Jiang Bian
Memories are One-to-Many Mapping Alleviators in Talking Face Generation.
CoRR
(2022)
Liyang Chen
,
Zhiyong Wu
,
Jun Ling
,
Runnan Li
,
Xu Tan
,
Sheng Zhao
Transformer-S2A: Robust and Efficient Speech-to-Animation.
CoRR
(2021)
Xiangyu Liang
,
Zhiyong Wu
,
Runnan Li
,
Yanqing Liu
,
Sheng Zhao
,
Helen Meng
Enhancing Monotonicity for Robust Autoregressive Transformer TTS.
INTERSPEECH
(2020)
Dongyang Dai
,
Zhiyong Wu
,
Runnan Li
,
Xixin Wu
,
Jia Jia
,
Helen Meng
Learning Discriminative Features from Spectrograms Using Center Loss for Speech Emotion Recognition.
ICASSP
(2019)
Liangqi Liu
,
Zhiyong Wu
,
Runnan Li
,
Jia Jia
,
Helen Meng
Learning Contextual Representation with Convolution Bank and Multi-head Self-attention for Speech Emphasis Detection.
APSIPA
(2019)
Jingbei Li
,
Zhiyong Wu
,
Runnan Li
,
Pengpeng Zhi
,
Song Yang
,
Helen Meng
Knowledge-Based Linguistic Encoding for End-to-End Mandarin Text-to-Speech Synthesis.
INTERSPEECH
(2019)
Hui Lu
,
Zhiyong Wu
,
Dongyang Dai
,
Runnan Li
,
Shiyin Kang
,
Jia Jia
,
Helen Meng
One-Shot Voice Conversion with Global Speaker Embeddings.
INTERSPEECH
(2019)
Runnan Li
,
Zhiyong Wu
,
Jia Jia
,
Yaohua Bu
,
Sheng Zhao
,
Helen Meng
Towards Discriminative Representation Learning for Speech Emotion Recognition.
IJCAI
(2019)
Hui Lu
,
Zhiyong Wu
,
Runnan Li
,
Shiyin Kang
,
Jia Jia
,
Helen Meng
A Compact Framework for Voice Conversion Using Wavenet Conditioned on Phonetic Posteriorgrams.
ICASSP
(2019)
Runnan Li
,
Zhiyong Wu
,
Jia Jia
,
Sheng Zhao
,
Helen Meng
Dilated Residual Network with Multi-head Self-attention for Speech Emotion Recognition.
ICASSP
(2019)
Shaoguang Mao
,
Zhiyong Wu
,
Runnan Li
,
Xu Li
,
Helen Meng
,
Lianhong Cai
Applying Multitask Learning to Acoustic-Phonemic Model for Mispronunciation Detection and Diagnosis in L2 English Speech.
ICASSP
(2018)
Jingbei Li
,
Zhiyong Wu
,
Runnan Li
,
Mingxing Xu
,
Kehua Lei
,
Lianhong Cai
Multi-modal Multi-scale Speech Expression Evaluation in Computer-Assisted Language Learning.
AIMS
(2018)
Shaoguang Mao
,
Zhiyong Wu
,
Xu Li
,
Runnan Li
,
Xixin Wu
,
Helen Meng
Integrating Articulatory Features into Acoustic-Phonemic Model for Mispronunciation Detection and Diagnosis in L2 English Speech.
ICME
(2018)
Runnan Li
,
Zhiyong Wu
,
Yuchen Huang
,
Jia Jia
,
Helen Meng
,
Lianhong Cai
Emphatic Speech Generation with Conditioned Input Layer and Bidirectional LSTMS for Expressive Speech Synthesis.
ICASSP
(2018)
Runnan Li
,
Zhiyong Wu
,
Jia Jia
,
Jingbei Li
,
Wei Chen
,
Helen Meng
Inferring User Emotive State Changes in Realistic Human-Computer Conversational Dialogs.
ACM Multimedia
(2018)
Ziwei Zhu
,
Zhiyong Wu
,
Runnan Li
,
Helen Meng
,
Lianhong Cai
Siamese Recurrent Auto-Encoder Representation for Query-by-Example Spoken Term Detection.
INTERSPEECH
(2018)
Ziwei Zhu
,
Zhiyong Wu
,
Runnan Li
,
Yishuang Ning
,
Helen Meng
Learning Frame-Level Recurrent Neural Networks Representations for Query-by-Example Spoken Term Detection on Mobile Devices.
AIMS
(2018)
Long Zhang
,
Jia Jia
,
Fanbo Meng
,
Suping Zhou
,
Wei Chen
,
Cunjun Zhang
,
Runnan Li
Emphasis Detection for Voice Dialogue Applications Using Multi-channel Convolutional Bidirectional Long Short-Term Memory Network.
ISCSLP
(2018)
Yishuang Ning
,
Jia Jia
,
Zhiyong Wu
,
Runnan Li
,
Yongsheng An
,
Yanfeng Wang
,
Helen M. Meng
Multi-Task Deep Learning for User Intention Understanding in Speech Interaction Systems.
AAAI
(2017)
Runnan Li
,
Zhiyong Wu
,
Yishuang Ning
,
Lifa Sun
,
Helen Meng
,
Lianhong Cai
Spectro-Temporal Modelling with Time-Frequency LSTM and Structured Output Layer for Voice Conversion.
INTERSPEECH
(2017)
Yishuang Ning
,
Zhiyong Wu
,
Runnan Li
,
Jia Jia
,
Mingxing Xu
,
Helen M. Meng
,
Lianhong Cai
Learning cross-lingual knowledge with multilingual BLSTM for emphasis detection with limited training data.
ICASSP
(2017)
Yuchen Huang
,
Zhiyong Wu
,
Runnan Li
,
Helen Meng
,
Lianhong Cai
Multi-Task Learning for Prosodic Structure Generation Using BLSTM RNN with Structured Output Layer.
INTERSPEECH
(2017)
Runnan Li
,
Zhiyong Wu
,
Xunying Liu
,
Helen M. Meng
,
Lianhong Cai
Multi-task learning of structured output layer bidirectional LSTMS for speech synthesis.
ICASSP
(2017)
Runnan Li
,
Zhiyong Wu
,
Helen M. Meng
,
Lianhong Cai
DBLSTM-based multi-task learning for pitch transformation in voice conversion.
ISCSLP
(2016)