​
Login / Signup
Yuhta Takida
ORCID
Publication Activity (10 Years)
Years Active: 2018-2024
Publications (10 Years): 37
Top Topics
Diffusion Models
Gibbs Sampler
Speech Enhancement
Variational Bayes
Top Venues
CoRR
ICASSP
ICML
ICLR
</>
Publications
</>
Koichi Saito
,
Dongjun Kim
,
Takashi Shibuya
,
Chieh-Hsin Lai
,
Zhi Zhong
,
Yuhta Takida
,
Yuki Mitsufuji
SoundCTM: Uniting Score-based and Consistency Models for Text-to-Sound Generation.
CoRR
(2024)
Dongjun Kim
,
Chieh-Hsin Lai
,
Wei-Hsiang Liao
,
Yuhta Takida
,
Naoki Murata
,
Toshimitsu Uesaka
,
Yuki Mitsufuji
,
Stefano Ermon
PaGoDA: Progressive Growing of a One-Step Generator from a Low-Resolution Diffusion Teacher.
CoRR
(2024)
Mengjie Zhao
,
Junya Ono
,
Zhi Zhong
,
Chieh-Hsin Lai
,
Yuhta Takida
,
Naoki Murata
,
Wei-Hsiang Liao
,
Takashi Shibuya
,
Hiromi Wakaki
,
Yuki Mitsufuji
On the Language Encoder of Contrastive Cross-modal Models.
ACL (Findings)
(2024)
Yutong He
,
Naoki Murata
,
Chieh-Hsin Lai
,
Yuhta Takida
,
Toshimitsu Uesaka
,
Dongjun Kim
,
Wei-Hsiang Liao
,
Yuki Mitsufuji
,
J. Zico Kolter
,
Ruslan Salakhutdinov
,
Stefano Ermon
Manifold Preserving Guided Diffusion.
ICLR
(2024)
Yuhta Takida
,
Yukara Ikemiya
,
Takashi Shibuya
,
Kazuki Shimada
,
Woosung Choi
,
Chieh-Hsin Lai
,
Naoki Murata
,
Toshimitsu Uesaka
,
Kengo Uchida
,
Wei-Hsiang Liao
,
Yuki Mitsufuji
HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes.
CoRR
(2024)
Yuhta Takida
,
Yukara Ikemiya
,
Takashi Shibuya
,
Kazuki Shimada
,
Woosung Choi
,
Chieh-Hsin Lai
,
Naoki Murata
,
Toshimitsu Uesaka
,
Kengo Uchida
,
Wei-Hsiang Liao
,
Yuki Mitsufuji
HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes.
Trans. Mach. Learn. Res.
2024 (2024)
Kengo Uchida
,
Takashi Shibuya
,
Yuhta Takida
,
Naoki Murata
,
Shusuke Takahashi
,
Yuki Mitsufuji
MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training.
CoRR
(2024)
Dongjun Kim
,
Chieh-Hsin Lai
,
Wei-Hsiang Liao
,
Naoki Murata
,
Yuhta Takida
,
Toshimitsu Uesaka
,
Yutong He
,
Yuki Mitsufuji
,
Stefano Ermon
Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion.
ICLR
(2024)
Yuhta Takida
,
Masaaki Imaizumi
,
Takashi Shibuya
,
Chieh-Hsin Lai
,
Toshimitsu Uesaka
,
Naoki Murata
,
Yuki Mitsufuji
SAN: Inducing Metrizability of GAN with Discriminative Normalized Linear Layer.
ICLR
(2024)
Takashi Shibuya
,
Yuhta Takida
,
Yuki Mitsufuji
BIGVSAN: Enhancing Gan-Based Neural Vocoders with Slicing Adversarial Network.
ICASSP
(2024)
Toshimitsu Uesaka
,
Taiji Suzuki
,
Yuhta Takida
,
Chieh-Hsin Lai
,
Naoki Murata
,
Yuki Mitsufuji
Understanding Multimodal Contrastive Learning Through Pointwise Mutual Information.
CoRR
(2024)
Naoki Murata
,
Koichi Saito
,
Chieh-Hsin Lai
,
Yuhta Takida
,
Toshimitsu Uesaka
,
Yuki Mitsufuji
,
Stefano Ermon
GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse Problems with Denoising Diffusion Restoration.
ICML
(2023)
Yuhta Takida
,
Masaaki Imaizumi
,
Chieh-Hsin Lai
,
Toshimitsu Uesaka
,
Naoki Murata
,
Yuki Mitsufuji
Adversarially Slicing Generative Networks: Discriminator Slices Feature for One-Dimensional Optimal Transport.
CoRR
(2023)
Keisuke Toyama
,
Taketo Akama
,
Yukara Ikemiya
,
Yuhta Takida
,
Wei-Hsiang Liao
,
Yuki Mitsufuji
Automatic Piano Transcription with Hierarchical Frequency-Time Transformer.
CoRR
(2023)
Naoki Murata
,
Koichi Saito
,
Chieh-Hsin Lai
,
Yuhta Takida
,
Toshimitsu Uesaka
,
Yuki Mitsufuji
,
Stefano Ermon
GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse Problems with Denoising Diffusion Restoration.
CoRR
(2023)
Yutong He
,
Naoki Murata
,
Chieh-Hsin Lai
,
Yuhta Takida
,
Toshimitsu Uesaka
,
Dongjun Kim
,
Wei-Hsiang Liao
,
Yuki Mitsufuji
,
J. Zico Kolter
,
Ruslan Salakhutdinov
,
Stefano Ermon
Manifold Preserving Guided Diffusion.
CoRR
(2023)
Ryosuke Sawata
,
Naoki Murata
,
Yuhta Takida
,
Toshimitsu Uesaka
,
Takashi Shibuya
,
Shusuke Takahashi
,
Yuki Mitsufuji
Diffiner: A Versatile Diffusion-based Generative Refiner for Speech Enhancement.
INTERSPEECH
(2023)
Koichi Saito
,
Naoki Murata
,
Toshimitsu Uesaka
,
Chieh-Hsin Lai
,
Yuhta Takida
,
Takao Fukui
,
Yuki Mitsufuji
Unsupervised Vocal Dereverberation with Diffusion-Based Generative Models.
ICASSP
(2023)
Takashi Shibuya
,
Yuhta Takida
,
Yuki Mitsufuji
BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial Network.
CoRR
(2023)
Dongjun Kim
,
Chieh-Hsin Lai
,
Wei-Hsiang Liao
,
Naoki Murata
,
Yuhta Takida
,
Toshimitsu Uesaka
,
Yutong He
,
Yuki Mitsufuji
,
Stefano Ermon
Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion.
CoRR
(2023)
Chieh-Hsin Lai
,
Yuhta Takida
,
Naoki Murata
,
Toshimitsu Uesaka
,
Yuki Mitsufuji
,
Stefano Ermon
FP-Diffusion: Improving Score-based Diffusion Models by Enforcing the Underlying Score Fokker-Planck Equation.
ICML
(2023)
Keisuke Toyama
,
Taketo Akama
,
Yukara Ikemiya
,
Yuhta Takida
,
Wei-Hsiang Liao
,
Yuki Mitsufuji
Automatic Piano Transcription With Hierarchical Frequency-Time Transformer.
ISMIR
(2023)
Chieh-Hsin Lai
,
Yuhta Takida
,
Toshimitsu Uesaka
,
Naoki Murata
,
Yuki Mitsufuji
,
Stefano Ermon
On the Equivalence of Consistency-Type Models: Consistency Models, Consistent Diffusion Models, and Fokker-Planck Regularization.
CoRR
(2023)
Mengjie Zhao
,
Junya Ono
,
Zhi Zhong
,
Chieh-Hsin Lai
,
Yuhta Takida
,
Naoki Murata
,
Wei-Hsiang Liao
,
Takashi Shibuya
,
Hiromi Wakaki
,
Yuki Mitsufuji
On the Language Encoder of Contrastive Cross-modal Models.
CoRR
(2023)
Yuhta Takida
,
Takashi Shibuya
,
Wei-Hsiang Liao
,
Chieh-Hsin Lai
,
Junki Ohmura
,
Toshimitsu Uesaka
,
Naoki Murata
,
Shusuke Takahashi
,
Toshiyuki Kumakura
,
Yuki Mitsufuji
SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization.
ICML
(2022)
Chieh-Hsin Lai
,
Yuhta Takida
,
Naoki Murata
,
Toshimitsu Uesaka
,
Yuki Mitsufuji
,
Stefano Ermon
Regularizing Score-based Models with Score Fokker-Planck Equations.
CoRR
(2022)
Koichi Saito
,
Naoki Murata
,
Toshimitsu Uesaka
,
Chieh-Hsin Lai
,
Yuhta Takida
,
Takao Fukui
,
Yuki Mitsufuji
Unsupervised vocal dereverberation with diffusion-based generative models.
CoRR
(2022)
Yuhta Takida
,
Takashi Shibuya
,
Wei-Hsiang Liao
,
Chieh-Hsin Lai
,
Junki Ohmura
,
Toshimitsu Uesaka
,
Naoki Murata
,
Shusuke Takahashi
,
Toshiyuki Kumakura
,
Yuki Mitsufuji
SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization.
CoRR
(2022)
Yuhta Takida
,
Wei-Hsiang Liao
,
Chieh-Hsin Lai
,
Toshimitsu Uesaka
,
Shusuke Takahashi
,
Yuki Mitsufuji
Preventing oversmoothing in VAE via generalized variance parameterization.
Neurocomputing
509 (2022)
Ryosuke Sawata
,
Naoki Murata
,
Yuhta Takida
,
Toshimitsu Uesaka
,
Takashi Shibuya
,
Shusuke Takahashi
,
Yuki Mitsufuji
A Versatile Diffusion-based Generative Refiner for Speech Enhancement.
CoRR
(2022)
Yuhta Takida
,
Wei-Hsiang Liao
,
Toshimitsu Uesaka
,
Shusuke Takahashi
,
Yuki Mitsufuji
Preventing Posterior Collapse Induced by Oversmoothing in Gaussian VAE.
CoRR
(2021)
Naoki Murata
,
Yuhta Takida
,
Tetsu Magariyachi
Fast Convergent Method for Active Noise Control Over Spatial Region with Causal Constraint.
WASPAA
(2021)
Yu Maeno
,
Yuhta Takida
,
Naoki Murata
,
Yuki Mitsufuji
Array-Geometry-Aware Spatial Active Noise Control Based on Direction-of-Arrival Weighting.
ICASSP
(2020)
Yuhta Takida
,
Shoichi Koyama
,
Natsuki Ueno
,
Hiroshi Saruwatari
Reciprocity gap functional in spherical harmonic domain for gridless sound field decomposition.
Signal Process.
169 (2020)
Yuhta Takida
,
Shoichi Koyama
,
Natsuki Ueno
,
Hiroshi Saruwatari
Robust Gridless Sound Field Decomposition Based on Structured Reciprocity Gap Functional in Spherical Harmonic Domain.
ICASSP
(2019)
Yuhta Takida
,
Shoichi Koyama
,
Hiroshi Saruwatari
Exterior and Interior Sound Field Separation Using Convex Optimization: Comparison of Signal Models.
EUSIPCO
(2018)
Yuhta Takida
,
Shoichi Koyama
,
Natsuki Ueno
,
Hiroshi Saruwatari
Gridless Sound Field Decomposition Based on Reciprocity Gap Functional in Spherical Harmonic Domain.
SAM
(2018)