Login / Signup
Kai Zhen
ORCID
Publication Activity (10 Years)
Years Active: 2011-2024
Publications (10 Years): 23
Top Topics
Discriminative Training
Speech Recognition
Neural Network
Lightweight
Top Venues
CoRR
ICASSP
INTERSPEECH
IEEE Signal Process. Lett.
</>
Publications
</>
Yunxiang Jiang
,
Qing Xu
,
Kai Zhen
,
Yu Chen
Quantitative Evaluation of driver's situation awareness in virtual driving through Eye tracking analysis.
CoRR
(2024)
Yifan Yang
,
Kai Zhen
,
Ershad Banijamal
,
Athanasios Mouchtaris
,
Zheng Zhang
AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language Models Fine-Tuning.
CoRR
(2024)
Rupak Vignesh Swaminathan
,
Grant P. Strimel
,
Ariya Rastrow
,
Sri Harish Mallidi
,
Kai Zhen
,
Hieu Duy Nguyen
,
Nathan Susanj
,
Athanasios Mouchtaris
Max-Margin Transducer Loss: Improving Sequence-Discriminative Training Using a Large-Margin Learning Strategy.
ICASSP
(2024)
Martin Radfar
,
Paulina Lyskawa
,
Brandon Trujillo
,
Yi Xie
,
Kai Zhen
,
Jahn Heymann
,
Denis Filimonov
,
Grant P. Strimel
,
Nathan Susanj
,
Athanasios Mouchtaris
Conmer: Streaming Conformer Without Self-attention for Interactive Voice Assistants.
INTERSPEECH
(2023)
Kai Zhen
,
Hieu Duy Nguyen
,
Raviteja Chinta
,
Nathan Susanj
,
Athanasios Mouchtaris
,
Tariq Afzal
,
Ariya Rastrow
Sub-8-Bit Quantization Aware Training for 8-Bit Neural Network Accelerator with On-Device Speech Recognition.
CoRR
(2022)
Kai Zhen
,
Martin Radfar
,
Hieu Duy Nguyen
,
Grant P. Strimel
,
Nathan Susanj
,
Athanasios Mouchtaris
Sub-8-Bit Quantization for On-Device Speech Recognition: A Regularization-Free Approach.
SLT
(2022)
Kai Zhen
,
Hieu Duy Nguyen
,
Raviteja Chinta
,
Nathan Susanj
,
Athanasios Mouchtaris
,
Tariq Afzal
,
Ariya Rastrow
Sub-8-Bit Quantization Aware Training for 8-Bit Neural Network Accelerator with On-Device Speech Recognition.
INTERSPEECH
(2022)
Kai Zhen
,
Jongmo Sung
,
Mi Suk Lee
,
Seungkwon Beack
,
Minje Kim
Scalable and Efficient Neural Speech Coding: A Hybrid Design.
IEEE ACM Trans. Audio Speech Lang. Process.
30 (2022)
Kai Zhen
,
Martin Radfar
,
Hieu Duy Nguyen
,
Grant P. Strimel
,
Nathan Susanj
,
Athanasios Mouchtaris
Sub-8-bit quantization for on-device speech recognition: a regularization-free approach.
CoRR
(2022)
Kai Zhen
,
Hieu Duy Nguyen
,
Feng-Ju Chang
,
Athanasios Mouchtaris
,
Ariya Rastrow
Sparsification via Compressed Sensing for Automatic Speech Recognition.
ICASSP
(2021)
Kai Zhen
,
Hieu Duy Nguyen
,
Feng-Ju Chang
,
Athanasios Mouchtaris
,
Ariya Rastrow
Sparsification via Compressed Sensing for Automatic Speech Recognition.
CoRR
(2021)
Kai Zhen
,
Mi Suk Lee
,
Jongmo Sung
,
Seungkwon Beack
,
Minje Kim
Psychoacoustic Calibration of Loss Functions for Efficient End-to-End Neural Audio Coding.
CoRR
(2021)
Haici Yang
,
Kai Zhen
,
Seungkwon Beack
,
Minje Kim
Source-Aware Neural Speech Coding for Noisy Speech Compression.
ICASSP
(2021)
Kai Zhen
,
Jongmo Sung
,
Mi Suk Lee
,
Seungkwon Beack
,
Minje Kim
Scalable and Efficient Neural Speech Coding.
CoRR
(2021)
Kai Zhen
,
Mi Suk Lee
,
Jongmo Sung
,
Seungkwon Beack
,
Minje Kim
Efficient And Scalable Neural Residual Waveform Coding With Collaborative Quantization.
CoRR
(2020)
Kai Zhen
,
Mi Suk Lee
,
Jongmo Sung
,
Seungkwon Beack
,
Minje Kim
Efficient and Scalable Neural Residual Waveform Coding with Collaborative Quantization.
ICASSP
(2020)
Kai Zhen
,
Mi Suk Lee
,
Minje Kim
A Dual-Staged Context Aggregation Method towards Efficient End-to-End Speech Enhancement.
ICASSP
(2020)
Kai Zhen
,
Mi Suk Lee
,
Jongmo Sung
,
Seungkwon Beack
,
Minje Kim
Psychoacoustic Calibration of Loss Functions for Efficient End-to-End Neural Audio Coding.
IEEE Signal Process. Lett.
27 (2020)
Kai Zhen
,
Jongmo Sung
,
Mi Suk Lee
,
Seungkwon Beack
,
Minje Kim
Cascaded Cross-Module Residual Learning Towards Lightweight End-to-End Speech Coding.
INTERSPEECH
(2019)
Kai Zhen
,
Mi Suk Lee
,
Minje Kim
Efficient Context Aggregation for End-to-End Speech Enhancement Using a Densely Connected Convolutional and Recurrent Network.
CoRR
(2019)
Kai Zhen
,
Jongmo Sung
,
Mi Suk Lee
,
Seungkwon Beack
,
Minje Kim
Cascaded Cross-Module Residual Learning towards Lightweight End-to-End Speech Coding.
CoRR
(2019)
Kai Zhen
,
Aswin Sivaraman
,
Jongmo Sung
,
Minje Kim
On Psychoacoustically Weighted Cost Functions Towards Resource-Efficient Deep Neural Networks for Speech Denoising.
CoRR
(2018)
Kai Zhen
,
Mridul Birla
,
David J. Crandall
,
Bingjing Zhang
,
Judy Qiu
Hybrid Supervised-unsupervised Image Topic Visualization with Convolutional Neural Network and LDA.
CoRR
(2017)
Liang Bao
,
Qian Li
,
Kai Zhen
,
Wei Xiang
,
Ping Chen
A functional flavor of service composition.
FSKD
(2011)