Login / Signup
Atsuhiko Kai
Publication Activity (10 Years)
Years Active: 1992-2023
Publications (10 Years): 17
Top Topics
Speaker Identification
Linear Prediction
Denoising
Speech Recognition
Top Venues
GCCE
INTERSPEECH
APSIPA
ISCSLP
</>
Publications
</>
Shogo Miwa
,
Atsuhiko Kai
Dialect Speech Recognition Modeling using Corpus of Japanese Dialects and Self-Supervised Learning-based Model XLSR.
INTERSPEECH
(2023)
Yoshiki Niimura
,
Jun Takemoto
,
Atsuhiko Kai
,
Seiichi Nakagawa
Attention-based CNN and Relative Phase Feature Modeling for Improved Imagined Speech Recognition.
APSIPA ASC
(2023)
Raufun Nahar
,
Shogo Miwa
,
Atsuhiko Kai
Domain Adaptation with Augmented Data by Deep Neural Network Based Method Using Re-Recorded Speech for Automatic Speech Recognition in Real Environment.
Sensors
22 (24) (2022)
Takumi Kurokawa
,
Atsuhiko Kai
Robust Query-by-example Spoken Term Detection for Unknown Words Using Speech Retrieval-oriented E2E ASR Modeling.
GCCE
(2021)
Ryota Sakai
,
Atsuhiko Kai
,
Seiichi Nakagawa
Classification of Imagined and Heard Speech Using Amplitude Spectrum and Relative Phase of EEG.
LifeTech
(2021)
Takumi Kurokawa
,
Atsuhiko Kai
Retrieval-oriented E2E ASR Modeling for Improved Query-by-example Spoken Term Detection.
APSIPA ASC
(2021)
Raufun Nahar
,
Atsuhiko Kai
Effect of Data Augmentation on DNN-Based VAD for Automatic Speech Recognition in Noisy Environment.
GCCE
(2020)
Takumi Kurokawa
,
Atsuhiko Kai
,
Hiroki Kondo
Effects of End-to-end ASR and Score Fusion Model Learning for Improved Query-by-example Spoken Term Detection.
APSIPA
(2020)
Raufun Nahar
,
Takashi Kawai
,
Atsuhiko Kai
Multi-Condition Training of Denoising Autoencoder by Augmenting Simulated Reverberant Speech Data.
GCCE
(2018)
Yuji Terada
,
Kenta Tamiya
,
Atsuhiko Kai
Investigation of efficient semi-automatic correction method using STD for automatic captioning.
GCCE
(2017)
Bo Ren
,
Longbiao Wang
,
Liang Lu
,
Yuma Ueda
,
Atsuhiko Kai
Combination of bottleneck feature extraction and dereverberation for distant-talking speech recognition.
Multim. Tools Appl.
75 (9) (2016)
Yuma Ueda
,
Longbiao Wang
,
Atsuhiko Kai
,
Xiong Xiao
,
Engsiong Chng
,
Haizhou Li
Single-channel Dereverberation for Distant-Talking Speech Recognition by Combining Denoising Autoencoder and Temporal Structure Normalization.
J. Signal Process. Syst.
82 (2) (2016)
Shuji Oishi
,
Tatsuya Matsuba
,
Mitsuaki Makino
,
Atsuhiko Kai
Combining State-level and DNN-based Acoustic Matches for Efficient Spoken Term Detection in NTCIR-12 SpokenQuery&Doc-2 Task.
NTCIR
(2016)
Shuji Oishi
,
Tatsuya Matsuba
,
Mitsuaki Makino
,
Atsuhiko Kai
Combining State-Level Spotting and Posterior-Based Acoustic Match for Improved Query-by-Example Spoken Term Detection.
INTERSPEECH
(2016)
Yuma Ueda
,
Longbiao Wang
,
Atsuhiko Kai
,
Bo Ren
Environment-dependent denoising autoencoder for distant-talking speech recognition.
EURASIP J. Adv. Signal Process.
2015 (2015)
Zhaofeng Zhang
,
Longbiao Wang
,
Atsuhiko Kai
,
Takanori Yamada
,
Weifeng Li
,
Masahiro Iwahashi
Deep neural network-based bottleneck feature and denoising autoencoder-based dereverberation for distant-talking speaker identification.
EURASIP J. Audio Speech Music. Process.
2015 (2015)
Bo Ren
,
Longbiao Wang
,
Atsuhiko Kai
,
Zhaofeng Zhang
Speech selection and environmental adaptation for asynchronous speech recognition.
APSIPA
(2015)
Yuta Kawakami
,
Longbiao Wang
,
Atsuhiko Kai
,
Seiichi Nakagawa
Speaker Identification by Combining Various Vocal Tract and Vocal Source Features.
TSD
(2014)
Satoshi Shiota
,
Longbiao Wang
,
Kyohei Odani
,
Atsuhiko Kai
,
Weifeng Li
Distant-talking speech recognition using multi-channel LMS and multiple-step linear prediction.
ISCSLP
(2014)
Ikuya Hirano
,
Kong-Aik Lee
,
Zhaofeng Zhang
,
Longbiao Wang
,
Atsuhiko Kai
Single-sided approach to discriminative PLDA training for text-independent speaker verification without using expanded i-vector.
ISCSLP
(2014)
Longbiao Wang
,
Bo Ren
,
Yuma Ueda
,
Atsuhiko Kai
,
Shunta Teraoka
,
Taku Fukushima
Denoising autoencoder and environment adaptation for distant-talking speech recognition with asynchronous speech recording.
APSIPA
(2014)
Mitsuaki Makino
,
Atsuhiko Kai
Combining Subword and State-level Dissimilarity Measures for Improved Spoken Term Detection in NTCIR-11 SpokenQuery&Doc Task.
NTCIR
(2014)
Yuma Ueda
,
Longbiao Wang
,
Atsuhiko Kai
,
Xiong Xiao
,
Engsiong Chng
,
Haizhou Li
Single-channel dereverberation for distant-talking speech recognition by combining denoising autoencoder and temporal structure normalization.
ISCSLP
(2014)
Zhaofeng Zhang
,
Longbiao Wang
,
Atsuhiko Kai
Distant-talking speaker identification by generalized spectral subtraction-based dereverberation and its efficient computation.
EURASIP J. Audio Speech Music. Process.
2014 (2014)
Mitsuaki Makino
,
Naoki Yamamoto
,
Atsuhiko Kai
Utilizing state-level distance vector representation for improved spoken term detection by text and spoken queries.
INTERSPEECH
(2014)
Naoki Yamamoto
,
Atsuhiko Kai
Using acoustic dissimilarity measures based on state-level distance vector representation for improved spoken term detection.
APSIPA
(2013)
Takanori Yamada
,
Longbiao Wang
,
Atsuhiko Kai
Improvement of distant-talking speaker identification using bottleneck features of DNN.
INTERSPEECH
(2013)
Naoki Yamamoto
,
Atsuhiko Kai
Spoken Term Detection Using Distance-Vector based Dissimilarity Measures and Its Evaluation on the NTCIR-10 SpokenDoc-2 Task.
NTCIR
(2013)
Longbiao Wang
,
Kyohei Odani
,
Atsuhiko Kai
,
Weifeng Li
Speech recognition using blind source separation and dereverberation method for mixed sound of speech and music.
APSIPA
(2013)
Longbiao Wang
,
Zhaofeng Zhang
,
Atsuhiko Kai
Hands-free speaker identification based on spectral subtraction using a multi-channel least mean square approach.
ICASSP
(2013)
Longbiao Wang
,
Kyohei Odani
,
Atsuhiko Kai
Dereverberation and denoising based on generalized spectral subtraction by multi-channel LMS algorithm using a small-scale microphone array.
EURASIP J. Adv. Signal Process.
2012 (2012)
Kyohei Odani
,
Longbiao Wang
,
Atsuhiko Kai
Speech Recognition by Denoising and Dereverberation Based on Spectral Subtraction in a Real Noisy Reverberant Environment.
INTERSPEECH
(2012)
Longbiao Wang
,
Zhaofeng Zhang
,
Atsuhiko Kai
,
Yoshiki Kishi
Distant-talking speaker identification using a reverberation model with various artificial room impulse responses.
APSIPA
(2012)
Ikuya Hirano
,
Longbiao Wang
,
Atsuhiko Kai
,
Seiichi Nakagawa
On the use of phase information-based joint factor analysis for speaker verification under channel mismatch condition.
APSIPA
(2012)
Zhaofeng Zhang
,
Longbiao Wang
,
Atsuhiko Kai
Dereverberantion based on generalized spectral subtraction for distant-talking speaker recognition.
APSIPA
(2012)
Longbiao Wang
,
Kyohei Odani
,
Atsuhiko Kai
Evaluation of Hands-Free Large Vocabulary Continuous Speech Recognition by Blind Dereverberation Based on Spectral Subtraction by Multi-channel LMS Algorithm.
TSD
(2011)
Noriki Fujiwara
,
Toshihiko Itoh
,
Kenji Araki
,
Atsuhiko Kai
,
Tatsuhiro Konishi
,
Yukihiro Itoh
Spoken language understanding method using confidence measure and dialogue history.
Systems and Computers in Japan
38 (9) (2007)
Toshihiko Itoh
,
Atsuhiko Kai
,
Yukihiro Itoh
,
Tatsuhiro Konishi
An understanding strategy based on plausibility score in recognition history using CSR confidence measure.
INTERSPEECH
(2004)
Toshihiko Itoh
,
Atsuhiko Kai
,
Tatsuhiro Konishi
,
Yukihiro Itoh
Linguistic and acoustic changes of user²s utterances caused by different dialogue situations.
INTERSPEECH
(2002)
Atsuhiko Kai
,
Yukari Nonomura
,
Toshihiko Itoh
,
Tatsuhiro Konishi
,
Yukihiro Itoh
Influence of different dialogue situations on user²s behavior in spoken corrections.
INTERSPEECH
(2002)
Atsuhiko Kai
,
Takahiro Nakano
,
Seiichi Nakagawa
Usability of Browser-Based Pen-Touch/Speech User Interfaces for Form-Based Application in Mobile Environment.
ICMI
(2000)
Atsuhiko Kai
,
Yoshifumi Hirose
,
Seiichi Nakagawa
Dealing with out-of-vocabulary words and speech disfluencies in an n-gram based speech understanding system.
ICSLP
(1998)
Atsuhiko Kai
,
Seiichi Nakagawa
Comparison of continuous speech recognition systems with unknown-word processing for speech disfluencies.
Systems and Computers in Japan
29 (9) (1998)
Atsuhiko Kai
,
Seiichi Nakagawa
Relationship among Recognition Rate, Rejection Rate and False Alarm Rate in a Spoken Word Recognition System.
IEICE Trans. Inf. Syst.
(6) (1995)
Atsuhiko Kai
,
Seiichi Nakagawa
Investigation on unknown word processing and strategies for spontaneous speech understanding.
EUROSPEECH
(1995)
Atsuhiko Kai
,
Seiichi Nakagawa
Evaluation of unknown word processing in a spoken word recognition system.
ICSLP
(1994)
Seiichi Nakagawa
,
Atsuhiko Kai
A context-free grammar-driven, one-pass HMM-based continuous speech recognition method.
Systems and Computers in Japan
25 (4) (1994)
Atsuhiko Kai
,
Seiichi Nakagawa
A frame-synchronous continuous speech recognition algorithm using a top-down parsing of context-free grammar.
ICSLP
(1992)