Sign in
Cheng-I Lai
Publication Activity (10 Years)
Years Active: 2018-2023
Publications (10 Years): 23
Top Topics
Speech Processing
Natural Language Generation
Speaker Recognition
Vector Quantisation
Top Venues
CoRR
INTERSPEECH
ICASSP
ACL (1)
</>
Publications
</>
Yuan Tseng
,
Cheng-I Lai
,
Hung-yi Lee
Cascading and Direct Approaches to Unsupervised Constituency Parsing on Spoken Sentences.
CoRR
(2023)
Hsiang-Sheng Tsai
,
Heng-Jui Chang
,
Wen-Chin Huang
,
Zili Huang
,
Kushal Lakhotia
,
Shu-Wen Yang
,
Shuyan Dong
,
Andy T. Liu
,
Cheng-I Lai
,
Jiatong Shi
,
Xuankai Chang
,
Phil Hall
,
Hsuan-Jui Chen
,
Shang-Wen Li
,
Shinji Watanabe
,
Abdelrahman Mohamed
,
Hung-yi Lee
SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities.
ACL (1)
(2022)
Yuan Gong
,
Cheng-I Lai
,
Yu-An Chung
,
James R. Glass
SSAST: Self-Supervised Audio Spectrogram Transformer.
AAAI
(2022)
Alexander H. Liu
,
Cheng-I Lai
,
Wei-Ning Hsu
,
Michael Auli
,
Alexei Baevski
,
James R. Glass
Simple and Effective Unsupervised Speech Synthesis.
INTERSPEECH
(2022)
Kaizhi Qian
,
Yang Zhang
,
Heting Gao
,
Junrui Ni
,
Cheng-I Lai
,
David D. Cox
,
Mark Hasegawa-Johnson
,
Shiyu Chang
ContentVec: An Improved Self-Supervised Speech Representation by Disentangling Speakers.
ICML
(2022)
Kaizhi Qian
,
Yang Zhang
,
Heting Gao
,
Junrui Ni
,
Cheng-I Lai
,
David D. Cox
,
Mark Hasegawa-Johnson
,
Shiyu Chang
Improving Self-Supervised Speech Representations by Disentangling Speakers.
CoRR
(2022)
Yonggan Fu
,
Yang Zhang
,
Kaizhi Qian
,
Zhifan Ye
,
Zhongzhi Yu
,
Cheng-I Lai
,
Yingyan Lin
Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Speech Processing.
CoRR
(2022)
Alexander H. Liu
,
SouYoung Jin
,
Cheng-I Lai
,
Andrew Rouditchenko
,
Aude Oliva
,
James R. Glass
Cross-Modal Discrete Representation Learning.
ACL (1)
(2022)
Cheng-I Lai
,
Yung-Sung Chuang
,
Hung-Yi Lee
,
Shang-Wen Li
,
James R. Glass
Semi-Supervised Spoken Language Understanding via Self-Supervised Speech and Language Model Pretraining.
ICASSP
(2021)
Cheng-I Lai
,
Yung-Sung Chuang
,
Hung-yi Lee
,
Shang-wen Li
,
James R. Glass
Semi-Supervised Spoken Language Understanding via Self-Supervised Speech and Language Model Pretraining.
CoRR
(2020)
Yi Zhao
,
Haoyu Li
,
Cheng-I Lai
,
Jennifer Williams
,
Erica Cooper
,
Junichi Yamagishi
Improved Prosody from Learned F0 Codebook Representations for VQ-VAE Speech Waveform Reconstruction.
INTERSPEECH
(2020)
Erica Cooper
,
Cheng-I Lai
,
Yusuke Yasuda
,
Junichi Yamagishi
Can Speaker Augmentation Improve Multi-Speaker End-to-End TTS?
INTERSPEECH
(2020)
Yi Zhao
,
Haoyu Li
,
Cheng-I Lai
,
Jennifer Williams
,
Erica Cooper
,
Junichi Yamagishi
Improved Prosody from Learned F0 Codebook Representations for VQ-VAE Speech Waveform Reconstruction.
CoRR
(2020)
Erica Cooper
,
Cheng-I Lai
,
Yusuke Yasuda
,
Fuming Fang
,
Xin Wang
,
Nanxin Chen
,
Junichi Yamagishi
Zero-Shot Multi-Speaker Text-To-Speech with State-Of-The-Art Neural Speaker Embeddings.
ICASSP
(2020)
Cheng-I Lai
,
Jin Cao
,
Sravan Bodapati
,
Shang-Wen Li
Towards Semi-Supervised Semantics Understanding from Speech.
CoRR
(2020)
Fan-Keng Sun
,
Cheng-I Lai
Conditioned Natural Language Generation using only Unconditioned Language Model: An Exploration.
CoRR
(2020)
Cheng-I Lai
,
Nanxin Chen
,
Jesús Villalba
,
Najim Dehak
ASSERT: Anti-Spoofing with Squeeze-Excitation and Residual neTworks.
CoRR
(2019)
Kelly Marchisio
,
Jialiang Guo
,
Cheng-I Lai
,
Philipp Koehn
Controlling the Reading Level of Machine Translation Output.
MTSummit (1)
(2019)
Cheng-I Lai
Contrastive Predictive Coding Based Feature for Automatic Speaker Verification.
CoRR
(2019)
Cheng-I Lai
,
Alberto Abad
,
Korin Richmond
,
Junichi Yamagishi
,
Najim Dehak
,
Simon King
Attentive Filtering Networks for Audio Replay Attack Detection.
ICASSP
(2019)
Cheng-I Lai
,
Nanxin Chen
,
Jesús Villalba
,
Najim Dehak
ASSERT: Anti-Spoofing with Squeeze-Excitation and Residual Networks.
INTERSPEECH
(2019)
Phani Sankar Nidadavolu
,
Cheng-I Lai
,
Jesús Villalba
,
Najim Dehak
Investigation on Bandwidth Extension for Speaker Recognition.
INTERSPEECH
(2018)
Cheng-I Lai
,
Alberto Abad
,
Korin Richmond
,
Junichi Yamagishi
,
Najim Dehak
,
Simon King
Attentive Filtering Networks for Audio Replay Attack Detection.
CoRR
(2018)