Login / Signup
Takashi Shibuya
ORCID
Publication Activity (10 Years)
Years Active: 2009-2024
Publications (10 Years): 28
Top Topics
Audio Files
Speech Enhancement
Cross Modal
Variational Bayes
Top Venues
CoRR
ICASSP
ACL (1)
ACL (Findings)
</>
Publications
</>
Kazuki Shimada
,
Kengo Uchida
,
Yuichiro Koyama
,
Takashi Shibuya
,
Shusuke Takahashi
,
Yuki Mitsufuji
,
Tatsuya Kawahara
Zero- and Few-Shot Sound Event Localization and Detection.
ICASSP
(2024)
Akio Hayakawa
,
Masato Ishii
,
Takashi Shibuya
,
Yuki Mitsufuji
Discriminator-Guided Cooperative Diffusion for Joint Audio and Video Generation.
CoRR
(2024)
Koichi Saito
,
Dongjun Kim
,
Takashi Shibuya
,
Chieh-Hsin Lai
,
Zhi Zhong
,
Yuhta Takida
,
Yuki Mitsufuji
SoundCTM: Uniting Score-based and Consistency Models for Text-to-Sound Generation.
CoRR
(2024)
Junyoung Seo
,
Kazumi Fukuda
,
Takashi Shibuya
,
Takuya Narihira
,
Naoki Murata
,
Shoukang Hu
,
Chieh-Hsin Lai
,
Seungryong Kim
,
Yuki Mitsufuji
GenWarp: Single Image to Novel Views with Semantic-Preserving Generative Warping.
CoRR
(2024)
Hao Shi
,
Kazuki Shimada
,
Masato Hirano
,
Takashi Shibuya
,
Yuichiro Koyama
,
Zhi Zhong
,
Shusuke Takahashi
,
Tatsuya Kawahara
,
Yuki Mitsufuji
Diffusion-Based Speech Enhancement with Joint Generative and Predictive Decoders.
ICASSP
(2024)
Mengjie Zhao
,
Junya Ono
,
Zhi Zhong
,
Chieh-Hsin Lai
,
Yuhta Takida
,
Naoki Murata
,
Wei-Hsiang Liao
,
Takashi Shibuya
,
Hiromi Wakaki
,
Yuki Mitsufuji
On the Language Encoder of Contrastive Cross-modal Models.
ACL (Findings)
(2024)
Yuhta Takida
,
Yukara Ikemiya
,
Takashi Shibuya
,
Kazuki Shimada
,
Woosung Choi
,
Chieh-Hsin Lai
,
Naoki Murata
,
Toshimitsu Uesaka
,
Kengo Uchida
,
Wei-Hsiang Liao
,
Yuki Mitsufuji
HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes.
CoRR
(2024)
Yuhta Takida
,
Yukara Ikemiya
,
Takashi Shibuya
,
Kazuki Shimada
,
Woosung Choi
,
Chieh-Hsin Lai
,
Naoki Murata
,
Toshimitsu Uesaka
,
Kengo Uchida
,
Wei-Hsiang Liao
,
Yuki Mitsufuji
HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes.
Trans. Mach. Learn. Res.
2024 (2024)
Marco Comunità
,
Zhi Zhong
,
Akira Takahashi
,
Shiqi Yang
,
Mengjie Zhao
,
Koichi Saito
,
Yukara Ikemiya
,
Takashi Shibuya
,
Shusuke Takahashi
,
Yuki Mitsufuji
SpecMaskGIT: Masked Generative Modeling of Audio Spectrograms for Efficient Audio Synthesis and Beyond.
CoRR
(2024)
Kengo Uchida
,
Takashi Shibuya
,
Yuhta Takida
,
Naoki Murata
,
Shusuke Takahashi
,
Yuki Mitsufuji
MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training.
CoRR
(2024)
Yuhta Takida
,
Masaaki Imaizumi
,
Takashi Shibuya
,
Chieh-Hsin Lai
,
Toshimitsu Uesaka
,
Naoki Murata
,
Yuki Mitsufuji
SAN: Inducing Metrizability of GAN with Discriminative Normalized Linear Layer.
ICLR
(2024)
Takashi Shibuya
,
Yuhta Takida
,
Yuki Mitsufuji
BIGVSAN: Enhancing Gan-Based Neural Vocoders with Slicing Adversarial Network.
ICASSP
(2024)
Shiqi Yang
,
Zhi Zhong
,
Mengjie Zhao
,
Shusuke Takahashi
,
Masato Ishii
,
Takashi Shibuya
,
Yuki Mitsufuji
Visual Echoes: A Simple Unified Transformer for Audio-Visual Generation.
CoRR
(2024)
Ryosuke Sawata
,
Naoki Murata
,
Yuhta Takida
,
Toshimitsu Uesaka
,
Takashi Shibuya
,
Shusuke Takahashi
,
Yuki Mitsufuji
Diffiner: A Versatile Diffusion-based Generative Refiner for Speech Enhancement.
INTERSPEECH
(2023)
Dong-Ho Lee
,
Akshen Kadakia
,
Brihi Joshi
,
Aaron Chan
,
Ziyi Liu
,
Kiran Narahari
,
Takashi Shibuya
,
Ryosuke Mitani
,
Toshiyuki Sekiya
,
Jay Pujara
,
Xiang Ren
XMD: An End-to-End Framework for Interactive Explanation-Based Debugging of NLP Models.
ACL (demo)
(2023)
Takashi Shibuya
,
Yuhta Takida
,
Yuki Mitsufuji
BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial Network.
CoRR
(2023)
Zhi Zhong
,
Hao Shi
,
Masato Hirano
,
Kazuki Shimada
,
Kazuya Tateishi
,
Takashi Shibuya
,
Shusuke Takahashi
,
Yuki Mitsufuji
Extending Audio Masked Autoencoders toward Audio Restoration.
WASPAA
(2023)
Hao Shi
,
Kazuki Shimada
,
Masato Hirano
,
Takashi Shibuya
,
Yuichiro Koyama
,
Zhi Zhong
,
Shusuke Takahashi
,
Tatsuya Kawahara
,
Yuki Mitsufuji
Diffusion-Based Speech Enhancement with Joint Generative and Predictive Decoders.
CoRR
(2023)
Zhi Zhong
,
Hao Shi
,
Masato Hirano
,
Kazuki Shimada
,
Kazuya Tateishi
,
Takashi Shibuya
,
Shusuke Takahashi
,
Yuki Mitsufuji
Extending Audio Masked Autoencoders Toward Audio Restoration.
CoRR
(2023)
Mengjie Zhao
,
Junya Ono
,
Zhi Zhong
,
Chieh-Hsin Lai
,
Yuhta Takida
,
Naoki Murata
,
Wei-Hsiang Liao
,
Takashi Shibuya
,
Hiromi Wakaki
,
Yuki Mitsufuji
On the Language Encoder of Contrastive Cross-modal Models.
CoRR
(2023)
Kazuki Shimada
,
Kengo Uchida
,
Yuichiro Koyama
,
Takashi Shibuya
,
Shusuke Takahashi
,
Yuki Mitsufuji
,
Tatsuya Kawahara
Zero- and Few-shot Sound Event Localization and Detection.
CoRR
(2023)
Yuhta Takida
,
Takashi Shibuya
,
Wei-Hsiang Liao
,
Chieh-Hsin Lai
,
Junki Ohmura
,
Toshimitsu Uesaka
,
Naoki Murata
,
Shusuke Takahashi
,
Toshiyuki Kumakura
,
Yuki Mitsufuji
SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization.
ICML
(2022)
Dong-Ho Lee
,
Akshen Kadakia
,
Brihi Joshi
,
Aaron Chan
,
Ziyi Liu
,
Kiran Narahari
,
Takashi Shibuya
,
Ryosuke Mitani
,
Toshiyuki Sekiya
,
Jay Pujara
,
Xiang Ren
XMD: An End-to-End Framework for Interactive Explanation-Based Debugging of NLP Models.
CoRR
(2022)
Yuhta Takida
,
Takashi Shibuya
,
Wei-Hsiang Liao
,
Chieh-Hsin Lai
,
Junki Ohmura
,
Toshimitsu Uesaka
,
Naoki Murata
,
Shusuke Takahashi
,
Toshiyuki Kumakura
,
Yuki Mitsufuji
SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization.
CoRR
(2022)
Dong-Ho Lee
,
Akshen Kadakia
,
Kangmin Tan
,
Mahak Agarwal
,
Xinyu Feng
,
Takashi Shibuya
,
Ryosuke Mitani
,
Toshiyuki Sekiya
,
Jay Pujara
,
Xiang Ren
Good Examples Make A Faster Learner: Simple Demonstration-based Learning for Low-resource NER.
ACL (1)
(2022)
Ryosuke Sawata
,
Naoki Murata
,
Yuhta Takida
,
Toshimitsu Uesaka
,
Takashi Shibuya
,
Shusuke Takahashi
,
Yuki Mitsufuji
A Versatile Diffusion-based Generative Refiner for Speech Enhancement.
CoRR
(2022)
Takashi Shibuya
,
Eduard H. Hovy
Nested Named Entity Recognition via Second-best Sequence Learning and Decoding.
Trans. Assoc. Comput. Linguistics
8 (2020)
Takashi Shibuya
,
Eduard H. Hovy
Nested Named Entity Recognition via Second-best Sequence Learning and Decoding.
CoRR
(2019)
Takashi Shibuya
,
Mototsugu Abe
,
Masayuki Nishiguchi
Audio fingerprinting robust against reverberation and noise based on quantification of sinusoidality.
ICME
(2013)
Takatsugu Kuriyama
,
Takashi Shibuya
,
Tatsuya Harada
,
Yasuo Kuniyoshi
Learning Interaction Rules through Compression of Sensori-Motor Causality Space.
EpiRob
(2010)
Takashi Shibuya
,
Tatsuya Harada
,
Yasuo Kuniyoshi
Causality quantification and its applications: structuring and modeling of multivariate time series.
KDD
(2009)