Login / Signup
Fenglong Xie
Publication Activity (10 Years)
Years Active: 2020-2024
Publications (10 Years): 8
Top Topics
Vector Quantized
Hybrid Model
Finite State Transducers
Speech Synthesis
Top Venues
CoRR
ICASSP
Interspeech
IEEE ACM Trans. Audio Speech Lang. Process.
</>
Publications
</>
Haohan Guo
,
Fenglong Xie
,
Dongchao Yang
,
Hui Lu
,
Xixin Wu
,
Helen Meng
Addressing Index Collapse of Large-Codebook Speech Tokenizer with Dual-Decoding Product-Quantized Variational Auto-Encoder.
CoRR
(2024)
Haohan Guo
,
Fenglong Xie
,
Jiawen Kang
,
Yujia Xiao
,
Xixin Wu
,
Helen Meng
QS-TTS: Towards Semi-Supervised Text-to-Speech Synthesis via Vector-Quantized Self-Supervised Speech Representation Learning.
CoRR
(2023)
Haohan Guo
,
Fenglong Xie
,
Xixin Wu
,
Frank K. Soong
,
Helen Meng
MSMC-TTS: Multi-Stage Multi-Codebook VQ-VAE Based Neural TTS.
IEEE ACM Trans. Audio Speech Lang. Process.
31 (2023)
Haohan Guo
,
Fenglong Xie
,
Xixin Wu
,
Hui Lu
,
Helen Meng
Towards High-Quality Neural TTS for Low-Resource Languages by Learning Compact Speech Representations.
CoRR
(2022)
Shilun Lin
,
Fenglong Xie
,
Li Meng
,
Xinhui Li
,
Li Lu
Triple M: A Practical Text-to-Speech Synthesis System with Multi-Guidance Attention and Multi-Band Multi-Time LPCNet.
Interspeech
(2021)
Shilun Lin
,
Wen-Chao Su
,
Li Meng
,
Fenglong Xie
,
Xinhui Li
,
Li Lu
Nana-HDR: A Non-attentive Non-autoregressive Hybrid Model for TTS.
CoRR
(2021)
Shilun Lin
,
Fenglong Xie
,
Xinhui Li
,
Li Lu
Triple M: A Practical Neural Text-to-speech System With Multi-guidance Attention And Multi-band Multi-time Lpcnet.
CoRR
(2021)
Yibin Zheng
,
Xinhui Li
,
Fenglong Xie
,
Li Lu
Improving End-to-End Speech Synthesis with Local Recurrent Neural Network Enhanced Transformer.
ICASSP
(2020)