​
Login / Signup
Xugang Lu
ORCID
Publication Activity (10 Years)
Years Active: 1999-2024
Publications (10 Years): 111
Top Topics
Distance Metric Learning
Acoustic Models
Neural Network
Speech Enhancement
Top Venues
CoRR
INTERSPEECH
ICASSP
ISCSLP
</>
Publications
</>
Ruiteng Zhang
,
Jianguo Wei
,
Xugang Lu
,
Wenhuan Lu
,
Di Jin
,
Lin Zhang
,
Junhai Xu
Unsupervised Adaptive Speaker Recognition by Coupling-Regularized Optimal Transport.
IEEE ACM Trans. Audio Speech Lang. Process.
32 (2024)
Xugang Lu
,
Peng Shen
,
Yu Tsao
,
Hisashi Kawai
Hierarchical Cross-Modality Knowledge Transfer with Sinkhorn Attention for CTC-Based ASR.
ICASSP
(2024)
Cho-Yuan Lee
,
Kuan-Chen Wang
,
Kai-Chun Liu
,
Xugang Lu
,
Ping-Chen Yeh
,
Yu Tsao
A Non-Intrusive Neural Quality Assessment Model for Surface Electromyography Signals.
CoRR
(2024)
Ruiteng Zhang
,
Jianguo Wei
,
Xugang Lu
,
Yongwei Li
,
Wenhuan Lu
,
Di Jin
,
Junhai Xu
Self-Supervised Domain Exploration with an Optimal Transport Regularization for Open Set Cross-Domain Speech Emotion Recognition.
ICASSP
(2024)
Yuxuan Li
,
Jianguo Wei
,
Qiang Fang
,
Xugang Lu
Evaluation of an Improved Ultrasonic Imaging Helmet for Observing Articulatory Data.
ICASSP
(2024)
Wenhao Yang
,
Jianguo Wei
,
Wenhuan Lu
,
Lei Li
,
Xugang Lu
Robust Channel Learning for Large-Scale Radio Speaker Verification.
CoRR
(2024)
Yang Liu
,
Haoqin Sun
,
Geng Chen
,
Qingyue Wang
,
Zhen Zhao
,
Xugang Lu
,
Longbiao Wang
Multi-Level Knowledge Distillation for Speech Emotion Recognition in Noisy Conditions.
CoRR
(2023)
Kai Li
,
Xugang Lu
,
Masato Akagi
,
Masashi Unoki
Contributions of Jitter and Shimmer in the Voice for Fake Audio Detection.
IEEE Access
11 (2023)
Xugang Lu
,
Peng Shen
,
Yu Tsao
,
Hisashi Kawai
Cross-Modal Alignment With Optimal Transport For CTC-Based ASR.
ASRU
(2023)
Ruiteng Zhang
,
Jianguo Wei
,
Xugang Lu
,
Wenhuan Lu
,
Di Jin
,
Lin Zhang
,
Junhai Xu
,
Jianwu Dang
TMS: Temporal multi-scale in time-delay neural network for speaker verification.
Appl. Intell.
53 (22) (2023)
Ruiteng Zhang
,
Jianguo Wei
,
Xugang Lu
,
Yongwei Li
,
Junhai Xu
,
Di Jin
,
Jianhua Tao
SOT: Self-supervised Learning-Assisted Optimal Transport for Unsupervised Adaptive Speech Emotion Recognition.
INTERSPEECH
(2023)
Xugang Lu
,
Peng Shen
,
Yu Tsao
,
Hisashi Kawai
Cross-modal Alignment with Optimal Transport for CTC-based ASR.
CoRR
(2023)
Peng Shen
,
Xugang Lu
,
Hisashi Kawai
Speaker Mask Transformer for Multi-talker Overlapped Speech Recognition.
CoRR
(2023)
Xugang Lu
,
Peng Shen
,
Yu Tsao
,
Hisashi Kawai
Neural domain alignment for spoken language recognition based on optimal transport.
CoRR
(2023)
Xugang Lu
,
Peng Shen
,
Yu Tsao
,
Hisashi Kawai
Hierarchical Cross-Modality Knowledge Transfer with Sinkhorn Attention for CTC-based ASR.
CoRR
(2023)
Yang Liu
,
Haoqin Sun
,
Geng Chen
,
Qingyue Wang
,
Zhen Zhao
,
Xugang Lu
,
Longbiao Wang
Multi-Level Knowledge Distillation for Speech Emotion Recognition in Noisy Conditions.
INTERSPEECH
(2023)
Kai Li
,
Dung Kim Tran
,
Xugang Lu
,
Masato Akagi
,
Masashi Unoki
Data-driven Non-uniform Filterbanks Based on F-ratio for Machine Anomalous Sound Detection.
EUSIPCO
(2023)
Ruiteng Zhang
,
Jianguo Wei
,
Xugang Lu
,
Wenhuan Lu
,
Di Jin
,
Lin Zhang
,
Yantao Ji
,
Junhai Xu
Self-supervised learning based domain regularization for mask-wearing speaker verification.
Speech Commun.
152 (2023)
Ruiteng Zhang
,
Jianguo Wei
,
Xugang Lu
,
Wenhuan Lu
,
Di Jin
,
Lin Zhang
,
Junhai Xu
Optimal Transport with a Diversified Memory Bank for Cross-Domain Speaker Verification.
ICASSP
(2023)
Tassadaq Hussain
,
Wei-Chien Wang
,
Mandar Gogate
,
Kia Dashtipour
,
Yu Tsao
,
Xugang Lu
,
Ahsan Adeel
,
Amir Hussain
A Novel Temporal Attentive-Pooling based Convolutional Recurrent Architecture for Acoustic Signal Enhancement.
IEEE Trans. Artif. Intell.
3 (5) (2022)
Ruiteng Zhang
,
Jianguo Wei
,
Wenhuan Lu
,
Lin Zhang
,
Yantao Ji
,
Junhai Xu
,
Xugang Lu
CS-REP: Making Speaker Verification Networks Embracing Re-Parameterization.
ICASSP
(2022)
Tassadaq Hussain
,
Wei-Chien Wang
,
Mandar Gogate
,
Kia Dashtipour
,
Yu Tsao
,
Xugang Lu
,
Ahsan Adeel
,
Amir Hussain
A Novel Temporal Attentive-Pooling based Convolutional Recurrent Architecture for Acoustic Signal Enhancement.
CoRR
(2022)
Rong Chao
,
Cheng Yu
,
Szu-Wei Fu
,
Xugang Lu
,
Yu Tsao
Perceptual Contrast Stretching on Target Feature for Speech Enhancement.
CoRR
(2022)
Kai Li
,
Xugang Lu
,
Masato Akagi
,
Jianwu Dang
,
Sheng Li
,
Masashi Unoki
Relationship Between Speakers' Physiological Structure and Acoustic Speech Signals: Data-Driven Study Based on Frequency-Wise Attentional Neural Network.
EUSIPCO
(2022)
Peng Shen
,
Xugang Lu
,
Hisashi Kawai
Transducer-based language embedding for spoken language identification.
INTERSPEECH
(2022)
Kai Li
,
Sheng Li
,
Xugang Lu
,
Masato Akagi
,
Meng Liu
,
Lin Zhang
,
Chang Zeng
,
Longbiao Wang
,
Jianwu Dang
,
Masashi Unoki
Data Augmentation Using McAdams-Coefficient-Based Speaker Anonymization for Fake Audio Detection.
INTERSPEECH
(2022)
Ruiteng Zhang
,
Jianguo Wei
,
Xugang Lu
,
Wenhuan Lu
,
Di Jin
,
Junhai Xu
,
Lin Zhang
,
Yantao Ji
,
Jianwu Dang
TMS: A Temporal Multi-scale Backbone Design for Speaker Embedding.
CoRR
(2022)
Peng Shen
,
Xugang Lu
,
Hisashi Kawai
Pronunciation-Aware Unique Character Encoding for RNN Transducer-Based Mandarin Speech Recognition.
SLT
(2022)
Peng Shen
,
Xugang Lu
,
Hisashi Kawai
Pronunciation-aware unique character encoding for RNN Transducer-based Mandarin speech recognition.
CoRR
(2022)
Peng Shen
,
Xugang Lu
,
Hisashi Kawai
Transducer-based language embedding for spoken language identification.
CoRR
(2022)
Xugang Lu
,
Peng Shen
,
Yu Tsao
,
Hisashi Kawai
Partial Coupling of Optimal Transport for Spoken Language Identification.
CoRR
(2022)
Rong Chao
,
Cheng Yu
,
Szu-Wei Fu
,
Xugang Lu
,
Yu Tsao
Perceptual Contrast Stretching on Target Feature for Speech Enhancement.
INTERSPEECH
(2022)
Ginji Hayashi
,
Shigeru Katagiri
,
Xugang Lu
,
Miho Ohsaki
An Investigation of Feature Difference Between Child and Adult Voices Using Line Spectral Pairs.
SPML
(2022)
Xugang Lu
,
Peng Shen
,
Yu Tsao
,
Hisashi Kawai
Integrating a joint Bayesian generative model in a discriminative learning framework for speaker verification.
CoRR
(2021)
Hsin-Yi Lin
,
Huan-Hsin Tseng
,
Xugang Lu
,
Yu Tsao
Unsupervised Noise Adaptive Speech Enhancement by Discriminator-Constrained Optimal Transport.
CoRR
(2021)
Xugang Lu
,
Peng Shen
,
Yu Tsao
,
Hisashi Kawai
Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification.
CoRR
(2021)
Ruiteng Zhang
,
Jianguo Wei
,
Wenhuan Lu
,
Lin Zhang
,
Yantao Ji
,
Junhai Xu
,
Xugang Lu
CS-Rep: Making Speaker Verification Networks Embracing Re-parameterization.
CoRR
(2021)
Hsin-Yi Lin
,
Huan-Hsin Tseng
,
Xugang Lu
,
Yu Tsao
Unsupervised Noise Adaptive Speech Enhancement by Discriminator-Constrained Optimal Transport.
NeurIPS
(2021)
Yu-Wen Chen
,
Kuo-Hsuan Hung
,
Shang-Yi Chuang
,
Jonathan Sherman
,
Wen-Chin Huang
,
Xugang Lu
,
Yu Tsao
EMA2S: An End-to-End Multimodal Articulatory-to-Speech System.
ISCAS
(2021)
Xugang Lu
,
Peng Shen
,
Yu Tsao
,
Hisashi Kawai
Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification.
APSIPA ASC
(2021)
Xugang Lu
,
Peng Shen
,
Yu Tsao
,
Hisashi Kawai
Coupling a Generative Model With a Discriminative Learning Framework for Speaker Verification.
IEEE ACM Trans. Audio Speech Lang. Process.
29 (2021)
Szu-Wei Fu
,
Cheng Yu
,
Tsun-An Hsieh
,
Peter Plantinga
,
Mirco Ravanelli
,
Xugang Lu
,
Yu Tsao
MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement.
Interspeech
(2021)
Xugang Lu
,
Peng Shen
,
Yu Tsao
,
Hisashi Kawai
Unsupervised Neural Adaptation Model Based on Optimal Transport for Spoken Language Identification.
ICASSP
(2021)
Tsun-An Hsieh
,
Cheng Yu
,
Szu-Wei Fu
,
Xugang Lu
,
Yu Tsao
Improving Perceptual Quality by Phone-Fortified Perceptual Loss Using Wasserstein Distance for Speech Enhancement.
Interspeech
(2021)
Yu-Wen Chen
,
Kuo-Hsuan Hung
,
Shang-Yi Chuang
,
Jonathan Sherman
,
Wen-Chin Huang
,
Xugang Lu
,
Yu Tsao
EMA2S: An End-to-End Multimodal Articulatory-to-Speech System.
CoRR
(2021)
Szu-Wei Fu
,
Cheng Yu
,
Tsun-An Hsieh
,
Peter Plantinga
,
Mirco Ravanelli
,
Xugang Lu
,
Yu Tsao
MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement.
CoRR
(2021)
Yu-Wen Chen
,
Kuo-Hsuan Hung
,
Shang-Yi Chuang
,
Jonathan Sherman
,
Xugang Lu
,
Yu Tsao
A Study of Incorporating Articulatory Movement Information in Speech Enhancement.
EUSIPCO
(2021)
Yen-Ju Lu
,
Chien-Feng Liao
,
Xugang Lu
,
Jeih-weih Hung
,
Yu Tsao
Incorporating Broad Phonetic Information for Speech Enhancement.
CoRR
(2020)
Peng Shen
,
Xugang Lu
,
Sheng Li
,
Hisashi Kawai
Knowledge Distillation-Based Representation Learning for Short-Utterance Spoken Language Identification.
IEEE ACM Trans. Audio Speech Lang. Process.
28 (2020)
Peng Shen
,
Xugang Lu
,
Hisashi Kawai
Investigation of NICT Submission for Short-Duration Speaker Verification Challenge 2020.
INTERSPEECH
(2020)
Cheng Yu
,
Ryandhimas E. Zezario
,
Syu-Siang Wang
,
Jonathan Sherman
,
Yi-Yen Hsieh
,
Xugang Lu
,
Hsin-Min Wang
,
Yu Tsao
Speech Enhancement Based on Denoising Autoencoder With Multi-Branched Encoders.
IEEE ACM Trans. Audio Speech Lang. Process.
28 (2020)
Tsun-An Hsieh
,
Cheng Yu
,
Szu-Wei Fu
,
Xugang Lu
,
Yu Tsao
Improving Perceptual Quality by Phone-Fortified Perceptual Loss for Speech Enhancement.
CoRR
(2020)
Cheng Yu
,
Ryandhimas E. Zezario
,
Jonathan Sherman
,
Yi-Yen Hsieh
,
Xugang Lu
,
Hsin-Min Wang
,
Yu Tsao
Speech Enhancement based on Denoising Autoencoder with Multi-branched Encoders.
CoRR
(2020)
Xugang Lu
,
Peng Shen
,
Yu Tsao
,
Hisashi Kawai
Unsupervised neural adaptation model based on optimal transport for spoken language identification.
CoRR
(2020)
Yen-Ju Lu
,
Chien-Feng Liao
,
Xugang Lu
,
Jeih-weih Hung
,
Yu Tsao
Incorporating Broad Phonetic Information for Speech Enhancement.
INTERSPEECH
(2020)
Tsun-An Hsieh
,
Hsin-Min Wang
,
Xugang Lu
,
Yu Tsao
WaveCRN: An Efficient Convolutional Recurrent Neural Network for End-to-End Speech Enhancement.
IEEE Signal Process. Lett.
27 (2020)
Tsun-An Hsieh
,
Hsin-Min Wang
,
Xugang Lu
,
Yu Tsao
WaveCRN: An Efficient Convolutional Recurrent Neural Network for End-to-end Speech Enhancement.
CoRR
(2020)
Haipeng Sun
,
Rui Wang
,
Kehai Chen
,
Xugang Lu
,
Masao Utiyama
,
Eiichiro Sumita
,
Tiejun Zhao
Robust Unsupervised Neural Machine Translation with Adversarial Denoising Training.
COLING
(2020)
Sheng Li
,
Xugang Lu
,
Raj Dabre
,
Peng Shen
,
Hisashi Kawai
Joint Training End-to-End Speech Recognition Systems with Speaker Attributes.
Odyssey
(2020)
Peng Shen
,
Xugang Lu
,
Komei Sugiura
,
Sheng Li
,
Hisashi Kawai
Compensation on x-vector for Short Utterance Spoken Language Identification.
Odyssey
(2020)
Ryandhimas E. Zezario
,
Tassadaq Hussain
,
Xugang Lu
,
Hsin-Min Wang
,
Yu Tsao
Self-Supervised Denoising Autoencoder with Linear Regression Decoder for Speech Enhancement.
ICASSP
(2020)
Xugang Lu
,
Peng Shen
,
Sheng Li
,
Yu Tsao
,
Hisashi Kawai
Deep progressive multi-scale attention for acoustic event classification.
CoRR
(2019)
Chien-Feng Liao
,
Yu Tsao
,
Xugang Lu
,
Hisashi Kawai
Incorporating Symbolic Sequential Modeling for Speech Enhancement.
CoRR
(2019)
Natalie Yu-Hsien Wang
,
Hsiao-Lan Sharon Wang
,
Taowei Wang
,
Szu-Wei Fu
,
Xugang Lu
,
Yu Tsao
,
Hsin-Min Wang
Improving the Intelligibility of Electric and Acoustic Stimulation Speech Using Fully Convolutional Networks Based Speech Enhancement.
CoRR
(2019)
Yuya Tomotoshi
,
David Ha
,
Emilie Delattre
,
Hideyuki Watanabe
,
Xugang Lu
,
Shigeru Katagiri
,
Miho Ohsaki
Optimal Classifier Parameter Status Selection Based on Bayes Boundary-ness for Multi-ProtoType and Multi-Layer Perceptron Classifiers.
IUKM
(2019)
Sheng Li
,
Xugang Lu
,
Chenchen Ding
,
Peng Shen
,
Tatsuya Kawahara
,
Hisashi Kawai
Investigating Radical-Based End-to-End Speech Recognition Systems for Chinese Dialects and Japanese.
INTERSPEECH
(2019)
Sheng Li
,
Raj Dabre
,
Xugang Lu
,
Peng Shen
,
Tatsuya Kawahara
,
Hisashi Kawai
Improving Transformer-Based Speech Recognition Systems with Compressed Structure and Speech Attributes Augmentation.
INTERSPEECH
(2019)
Sheng Li
,
Chenchen Ding
,
Xugang Lu
,
Peng Shen
,
Tatsuya Kawahara
,
Hisashi Kawai
End-to-End Articulatory Attribute Modeling for Low-Resource Multilingual Speech Recognition.
INTERSPEECH
(2019)
Ryandhimas E. Zezario
,
Szu-Wei Fu
,
Xugang Lu
,
Hsin-Min Wang
,
Yu Tsao
Specialized Speech Enhancement Model Selection Based on Learned Non-Intrusive Quality Assessment Metric.
INTERSPEECH
(2019)
Peng Shen
,
Xugang Lu
,
Sheng Li
,
Hisashi Kawai
Interactive Learning of Teacher-student Model for Short Utterance Spoken Language Identification.
ICASSP
(2019)
Xugang Lu
,
Peng Shen
,
Sheng Li
,
Yu Tsao
,
Hisashi Kawai
Class-Wise Centroid Distance Metric Learning for Acoustic Event Detection.
INTERSPEECH
(2019)
Chien-Feng Liao
,
Yu Tsao
,
Xugang Lu
,
Hisashi Kawai
Incorporating Symbolic Sequential Modeling for Speech Enhancement.
INTERSPEECH
(2019)
Peng Shen
,
Xugang Lu
,
Sheng Li
,
Hisashi Kawai
Feature Representation of Short Utterances Based on Knowledge Distillation for Spoken Language Identification.
INTERSPEECH
(2018)
Szu-Wei Fu
,
Taowei Wang
,
Yu Tsao
,
Xugang Lu
,
Hisashi Kawai
End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural Networks.
IEEE ACM Trans. Audio Speech Lang. Process.
26 (9) (2018)
Xugang Lu
,
Peng Shen
,
Sheng Li
,
Yu Tsao
,
Hisashi Kawai
Temporal Attentive Pooling for Acoustic Event Detection.
INTERSPEECH
(2018)
Wei-Jen Lee
,
Syu-Siang Wang
,
Fei Chen
,
Xugang Lu
,
Shao-Yi Chien
,
Yu Tsao
Speech Dereverberation Based on Integrated Deep and Ensemble Learning Algorithm.
ICASSP
(2018)
Sheng Li
,
Xugang Lu
,
Ryoichi Takashima
,
Peng Shen
,
Tatsuya Kawahara
,
Hisashi Kawai
Improving CTC-based Acoustic Model with Very Deep Residual Time-delay Neural Networks.
INTERSPEECH
(2018)
Sheng Li
,
Xugang Lu
,
Ryoichi Takashima
,
Peng Shen
,
Tatsuya Kawahara
,
Hisashi Kawai
Improving Very Deep Time-Delay Neural Network With Vertical-Attention For Effectively Training CTC-Based ASR Systems.
SLT
(2018)
Jianguo Wei
,
Yan Ji
,
Jingshu Zhang
,
Qiang Fang
,
Wenhuan Lu
,
Kiyoshi Honda
,
Xugang Lu
Study of articulators' contribution and compensation during speech by articulatory speech recognition.
Multim. Tools Appl.
77 (14) (2018)
Wei-Jen Lee
,
Syu-Siang Wang
,
Fei Chen
,
Xugang Lu
,
Shao-Yi Chien
,
Yu Tsao
Speech Dereverberation Based on Integrated Deep and Ensemble Learning.
CoRR
(2018)
Ryandhimas E. Zezario
,
Jen-Wei Huang
,
Xugang Lu
,
Yu Tsao
,
Hsin-Te Hwang
,
Hsin-Min Wang
Deep Denoising Autoencoder Based Post Filtering for Speech Enhancement.
APSIPA
(2018)
Peng Shen
,
Xugang Lu
,
Sheng Li
,
Hisashi Kawai
Conditional Generative Adversarial Nets Classifier for Spoken Language Identification.
INTERSPEECH
(2017)
Xugang Lu
,
Peng Shen
,
Yu Tsao
,
Hisashi Kawai
Regularization of neural network model with distance metric learning for i-vector based spoken language identification.
Comput. Speech Lang.
44 (2017)
Naoyuki Kanda
,
Xugang Lu
,
Hisashi Kawai
Maximum-a-Posteriori-Based Decoding for End-to-End Acoustic Models.
IEEE ACM Trans. Audio Speech Lang. Process.
25 (5) (2017)
Naoyuki Kanda
,
Xugang Lu
,
Hisashi Kawai
Minimum Bayes risk training of CTC acoustic models in maximum a posteriori based decoding framework.
ICASSP
(2017)
Szu-Wei Fu
,
Yu Tsao
,
Xugang Lu
,
Hisashi Kawai
Raw Waveform-based Speech Enhancement by Fully Convolutional Networks.
CoRR
(2017)
Szu-Wei Fu
,
Yu Tsao
,
Xugang Lu
,
Hisashi Kawai
Raw waveform-based speech enhancement by fully convolutional networks.
APSIPA
(2017)
Szu-Wei Fu
,
Yu Tsao
,
Xugang Lu
,
Hisashi Kawai
End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural Networks.
CoRR
(2017)
Sheng Li
,
Xugang Lu
,
Shinsuke Sakai
,
Masato Mimura
,
Tatsuya Kawahara
Semi-supervised ensemble DNN acoustic model training.
ICASSP
(2017)
Sheng Li
,
Xugang Lu
,
Peng Shen
,
Ryoichi Takashima
,
Tatsuya Kawahara
,
Hisashi Kawai
Incremental training and constructing the very deep convolutional residual network acoustic models.
ASRU
(2017)
Szu-Wei Fu
,
Ting-yao Hu
,
Yu Tsao
,
Xugang Lu
Multi-Metrics Learning for Speech Enhancement.
CoRR
(2017)
Szu-Wei Fu
,
Ting-yao Hu
,
Yu Tsao
,
Xugang Lu
Complex spectrogram enhancement by convolutional neural network with multi-metrics learning.
MLSP
(2017)
Shota Morita
,
Xugang Lu
,
Masashi Unoki
,
Masato Akagi
Method of Estimating Signal-to-Noise Ratio Based on Optimal Design for Sub-band Voice Activity Detection.
J. Inf. Hiding Multim. Signal Process.
8 (6) (2017)
Ying-Hui Lai
,
Fei Chen
,
Syu-Siang Wang
,
Xugang Lu
,
Yu Tsao
,
Chin-Hui Lee
A Deep Denoising Autoencoder Approach to Improving the Intelligibility of Vocoded Speech in Cochlear Implant Simulation.
IEEE Trans. Biomed. Eng.
64 (7) (2017)
Xiaoyun Wang
,
Xugang Lu
,
Hisashi Kawai
,
Seiichi Yamamoto
Contour Analysis Based on Empirical Mode Decomposition for DNN Acoustic Modeling in Mandarin Speech Recognition.
INTERSPEECH
(2016)
Syu-Siang Wang
,
Alan Chern
,
Yu Tsao
,
Jeih-weih Hung
,
Xugang Lu
,
Ying-Hui Lai
,
Borching Su
Wavelet Speech Enhancement Based on Nonnegative Matrix Factorization.
IEEE Signal Process. Lett.
23 (8) (2016)
Peng Shen
,
Xugang Lu
,
Hisashi Kawai
Automatic acoustic segmentation in N-best list rescoring for lecture speech recognition.
ISCSLP
(2016)
Chia-Yung Hsu
,
Ryandhimas E. Zezario
,
Jia-Ching Wang
,
Chin-Wen Ho
,
Xugang Lu
,
Yu Tsao
Incorporating local environment information with ensemble neural networks to robust automatic speech recognition.
ISCSLP
(2016)
Naoyuki Kanda
,
Shoji Harada
,
Xugang Lu
,
Hisashi Kawai
Investigation of Semi-Supervised Acoustic Model Training Based on the Committee of Heterogeneous Neural Networks.
INTERSPEECH
(2016)
Peng Shen
,
Xugang Lu
,
Xinhui Hu
,
Naoyuki Kanda
,
Masahiro Saiko
,
Chiori Hori
,
Hisashi Kawai
Combination of multiple acoustic models with unsupervised adaptation for lecture speech transcription.
Speech Commun.
82 (2016)