Login / Signup
Kazuki Shimada
ORCID
Publication Activity (10 Years)
Years Active: 2017-2024
Publications (10 Years): 32
Top Topics
Shot Detection
Speech Enhancement
Nonnegative Matrix Factorization
Variational Bayes
Top Venues
CoRR
ICASSP
WASPAA
DCASE
</>
Publications
</>
Kazuki Shimada
,
Kengo Uchida
,
Yuichiro Koyama
,
Takashi Shibuya
,
Shusuke Takahashi
,
Yuki Mitsufuji
,
Tatsuya Kawahara
Zero- and Few-Shot Sound Event Localization and Detection.
ICASSP
(2024)
Hao Shi
,
Kazuki Shimada
,
Masato Hirano
,
Takashi Shibuya
,
Yuichiro Koyama
,
Zhi Zhong
,
Shusuke Takahashi
,
Tatsuya Kawahara
,
Yuki Mitsufuji
Diffusion-Based Speech Enhancement with Joint Generative and Predictive Decoders.
ICASSP
(2024)
Yuhta Takida
,
Yukara Ikemiya
,
Takashi Shibuya
,
Kazuki Shimada
,
Woosung Choi
,
Chieh-Hsin Lai
,
Naoki Murata
,
Toshimitsu Uesaka
,
Kengo Uchida
,
Wei-Hsiang Liao
,
Yuki Mitsufuji
HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes.
CoRR
(2024)
Yuhta Takida
,
Yukara Ikemiya
,
Takashi Shibuya
,
Kazuki Shimada
,
Woosung Choi
,
Chieh-Hsin Lai
,
Naoki Murata
,
Toshimitsu Uesaka
,
Kengo Uchida
,
Wei-Hsiang Liao
,
Yuki Mitsufuji
HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes.
Trans. Mach. Learn. Res.
2024 (2024)
Zhi Zhong
,
Masato Hirano
,
Kazuki Shimada
,
Kazuya Tateishi
,
Shusuke Takahashi
,
Yuki Mitsufuji
An Attention-Based Approach to Hierarchical Multi-Label Music Instrument Classification.
ICASSP
(2023)
Kazuki Shimada
,
Archontis Politis
,
Parthasaarathy Sudarsanam
,
Daniel Aleksander Krause
,
Kengo Uchida
,
Sharath Adavanne
,
Aapo Hakala
,
Yuichiro Koyama
,
Naoya Takahashi
,
Shusuke Takahashi
,
Tuomas Virtanen
,
Yuki Mitsufuji
STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events.
NeurIPS
(2023)
Masato Hirano
,
Kazuki Shimada
,
Yuichiro Koyama
,
Shusuke Takahashi
,
Yuki Mitsufuji
Diffusion-based Signal Refiner for Speech Separation.
CoRR
(2023)
Zhi Zhong
,
Masato Hirano
,
Kazuki Shimada
,
Kazuya Tateishi
,
Shusuke Takahashi
,
Yuki Mitsufuji
An Attention-based Approach to Hierarchical Multi-label Music Instrument Classification.
CoRR
(2023)
Zhi Zhong
,
Hao Shi
,
Masato Hirano
,
Kazuki Shimada
,
Kazuya Tateishi
,
Takashi Shibuya
,
Shusuke Takahashi
,
Yuki Mitsufuji
Extending Audio Masked Autoencoders toward Audio Restoration.
WASPAA
(2023)
Hao Shi
,
Kazuki Shimada
,
Masato Hirano
,
Takashi Shibuya
,
Yuichiro Koyama
,
Zhi Zhong
,
Shusuke Takahashi
,
Tatsuya Kawahara
,
Yuki Mitsufuji
Diffusion-Based Speech Enhancement with Joint Generative and Predictive Decoders.
CoRR
(2023)
Zhi Zhong
,
Hao Shi
,
Masato Hirano
,
Kazuki Shimada
,
Kazuya Tateishi
,
Takashi Shibuya
,
Shusuke Takahashi
,
Yuki Mitsufuji
Extending Audio Masked Autoencoders Toward Audio Restoration.
CoRR
(2023)
Kazuki Shimada
,
Kengo Uchida
,
Yuichiro Koyama
,
Takashi Shibuya
,
Shusuke Takahashi
,
Yuki Mitsufuji
,
Tatsuya Kawahara
Zero- and Few-shot Sound Event Localization and Detection.
CoRR
(2023)
Kazuki Shimada
,
Archontis Politis
,
Parthasaarathy Sudarsanam
,
Daniel Krause
,
Kengo Uchida
,
Sharath Adavanne
,
Aapo Hakala
,
Yuichiro Koyama
,
Naoya Takahashi
,
Shusuke Takahashi
,
Tuomas Virtanen
,
Yuki Mitsufuji
STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events.
CoRR
(2023)
Yuichiro Koyama
,
Kazuhide Shigemi
,
Masafumi Takahashi
,
Kazuki Shimada
,
Naoya Takahashi
,
Emiru Tsunoo
,
Shusuke Takahashi
,
Yuki Mitsufuji
Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection.
ICASSP
(2022)
Archontis Politis
,
Kazuki Shimada
,
Parthasaarathy Sudarsanam
,
Sharath Adavanne
,
Daniel Krause
,
Yuichiro Koyama
,
Naoya Takahashi
,
Shusuke Takahashi
,
Yuki Mitsufuji
,
Tuomas Virtanen
STARSS22: A Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events.
DCASE
(2022)
Kazuki Shimada
,
Taishi Sawabe
,
Hidehiko Shishido
,
Masayuki Kanbara
,
Itaru Kitahara
Video Generation Unconsciously Evoking Pre-Motion to Passengers in Automated Vehicles.
ISMAR Adjunct
(2022)
Kazuki Shimada
,
Yuichiro Koyama
,
Shusuke Takahashi
,
Naoya Takahashi
,
Emiru Tsunoo
,
Yuki Mitsufuji
Multi-ACCDOA: Localizing And Detecting Overlapping Sounds From The Same Class With Auxiliary Duplicating Permutation Invariant Training.
ICASSP
(2022)
Archontis Politis
,
Kazuki Shimada
,
Parthasaarathy Sudarsanam
,
Sharath Adavanne
,
Daniel Krause
,
Yuichiro Koyama
,
Naoya Takahashi
,
Shusuke Takahashi
,
Yuki Mitsufuji
,
Tuomas Virtanen
STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events.
CoRR
(2022)
Ricardo Falcón Pérez
,
Kazuki Shimada
,
Yuichiro Koyama
,
Shusuke Takahashi
,
Yuki Mitsufuji
Spatial Mixup: Directional Loudness Modification as Data Augmentation for Sound Event Localization and Detection.
ICASSP
(2022)
Kazuki Shimada
,
Yuichiro Koyama
,
Shusuke Takahashi
,
Naoya Takahashi
,
Emiru Tsunoo
,
Yuki Mitsufuji
Multi-ACCDOA: Localizing and Detecting Overlapping Sounds from the Same Class with Auxiliary Duplicating Permutation Invariant Training.
CoRR
(2021)
Kazuki Shimada
,
Naoya Takahashi
,
Yuichiro Koyama
,
Shusuke Takahashi
,
Emiru Tsunoo
,
Masafumi Takahashi
,
Yuki Mitsufuji
Ensemble of ACCDOA- and EINV2-based Systems with D3Nets and Impulse Response Simulation for Sound Event Localization and Detection.
CoRR
(2021)
Yuichiro Koyama
,
Kazuhide Shigemi
,
Masafumi Takahashi
,
Kazuki Shimada
,
Naoya Takahashi
,
Emiru Tsunoo
,
Shusuke Takahashi
,
Yuki Mitsufuji
Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection.
CoRR
(2021)
Kazuki Shimada
,
Yuichiro Koyama
,
Naoya Takahashi
,
Shusuke Takahashi
,
Yuki Mitsufuji
Accdoa: Activity-Coupled Cartesian Direction of Arrival Representation for Sound Event Localization And Detection.
ICASSP
(2021)
Ricardo Falcon-Perez
,
Kazuki Shimada
,
Yuichiro Koyama
,
Shusuke Takahashi
,
Yuki Mitsufuji
Spatial mixup: Directional loudness modification as data augmentation for sound event localization and detection.
CoRR
(2021)
Kazuki Shimada
,
Yuichiro Koyama
,
Naoya Takahashi
,
Shusuke Takahashi
,
Yuki Mitsufuji
ACCDOA: Activity-Coupled Cartesian Direction of Arrival Representation for Sound Event Localization and Detection.
CoRR
(2020)
Kazuki Shimada
,
Yuichiro Koyama
,
Akira Inoue
Metric Learning with Background Noise Class for Few-Shot Detection of Rare Sound Events.
ICASSP
(2020)
Kazuki Shimada
,
Naoya Takahashi
,
Shusuke Takahashi
,
Yuki Mitsufuji
Sound Event Localization and Detection Using Activity-Coupled Cartesian DOA Vector and RD3net.
CoRR
(2020)
Kazuki Shimada
,
Yoshiaki Bando
,
Masato Mimura
,
Katsutoshi Itoyama
,
Kazuyoshi Yoshii
,
Tatsuya Kawahara
Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process.
27 (5) (2019)
Kazuki Shimada
,
Yoshiaki Bando
,
Masato Mimura
,
Katsutoshi Itoyama
,
Kazuyoshi Yoshii
,
Tatsuya Kawahara
Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition.
CoRR
(2019)
Kazuki Shimada
,
Yuichiro Koyama
,
Akira Inoue
Metric Learning with Background Noise Class for Few-shot Detection of Rare Sound Events.
CoRR
(2019)
Kazuki Shimada
,
Yoshiaki Bando
,
Masato Mimura
,
Katsutoshi Itoyama
,
Kazuyoshi Yoshii
,
Tatsuya Kawahara
Unsupervised Beamforming Based on Multichannel Nonnegative Matrix Factorization for Noisy Speech Recognition.
ICASSP
(2018)
Masato Mimura
,
Yoshiaki Bando
,
Kazuki Shimada
,
Shinsuke Sakai
,
Kazuyoshi Yoshii
,
Tatsuya Kawahara
Combined Multi-Channel NMF-Based Robust Beamforming for Noisy Speech Recognition.
INTERSPEECH
(2017)