​
Login / Signup
Kentaro Tachibana
Publication Activity (10 Years)
Years Active: 2007-2024
Publications (10 Years): 34
Top Topics
Speech Synthesis
Top Venues
CoRR
INTERSPEECH
ICASSP
IEICE Trans. Inf. Syst.
</>
Publications
</>
Takuto Igarashi
,
Yuki Saito
,
Kentaro Seki
,
Shinnosuke Takamichi
,
Ryuichi Yamamoto
,
Kentaro Tachibana
,
Hiroshi Saruwatari
Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment.
CoRR
(2024)
Reo Shimizu
,
Ryuichi Yamamoto
,
Masaya Kawamura
,
Yuma Shirahata
,
Hironori Doi
,
Tatsuya Komatsu
,
Kentaro Tachibana
PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions.
ICASSP
(2024)
Masaya Kawamura
,
Ryuichi Yamamoto
,
Yuma Shirahata
,
Takuya Hasumi
,
Kentaro Tachibana
LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning.
CoRR
(2024)
Yuki Saito
,
Takuto Igarashi
,
Kentaro Seki
,
Shinnosuke Takamichi
,
Ryuichi Yamamoto
,
Kentaro Tachibana
,
Hiroshi Saruwatari
SRC4VC: Smartphone-Recorded Corpus for Voice Conversion Benchmark.
CoRR
(2024)
Yuki Saito
,
Shinnosuke Takamichi
,
Eiji Iimori
,
Kentaro Tachibana
,
Hiroshi Saruwatari
ChatGPT-EDSS: Empathetic Dialogue Speech Synthesis Trained from ChatGPT-derived Context Word Embeddings.
INTERSPEECH
(2023)
Masaya Kawamura
,
Yuma Shirahata
,
Ryuichi Yamamoto
,
Kentaro Tachibana
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform.
ICASSP
(2023)
Yuki Saito
,
Shinnosuke Takamichi
,
Eiji Iimori
,
Kentaro Tachibana
,
Hiroshi Saruwatari
ChatGPT-EDSS: Empathetic Dialogue Speech Synthesis Trained from ChatGPT-derived Context Word Embeddings.
CoRR
(2023)
Reo Yoneyama
,
Ryuichi Yamamoto
,
Kentaro Tachibana
Nonparallel High-Quality Audio Super Resolution with Domain Adaptation and Resampling CycleGANs.
ICASSP
(2023)
Yuma Shirahata
,
Ryuichi Yamamoto
,
Eunwoo Song
,
Ryo Terashima
,
Jae-Min Kim
,
Kentaro Tachibana
Period VITS: Variational Inference with Explicit Pitch Modeling for End-To-End Emotional Speech Synthesis.
ICASSP
(2023)
Yuki Saito
,
Eiji Iimori
,
Shinnosuke Takamichi
,
Kentaro Tachibana
,
Hiroshi Saruwatari
CALLS: Japanese Empathetic Dialogue Speech Corpus of Complaint Handling and Attentive Listening in Customer Center.
INTERSPEECH
(2023)
Yuki Saito
,
Eiji Iimori
,
Shinnosuke Takamichi
,
Kentaro Tachibana
,
Hiroshi Saruwatari
CALLS: Japanese Empathetic Dialogue Speech Corpus of Complaint Handling and Attentive Listening in Customer Center.
CoRR
(2023)
Reo Shimizu
,
Ryuichi Yamamoto
,
Masaya Kawamura
,
Yuma Shirahata
,
Hironori Doi
,
Tatsuya Komatsu
,
Kentaro Tachibana
PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-to-Speech Using Natural Language Descriptions.
CoRR
(2023)
Ryo Terashima
,
Ryuichi Yamamoto
,
Eunwoo Song
,
Yuma Shirahata
,
Hyun-Wook Yoon
,
Jae-Min Kim
,
Kentaro Tachibana
Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation.
CoRR
(2022)
Masaya Kawamura
,
Yuma Shirahata
,
Ryuichi Yamamoto
,
Kentaro Tachibana
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform.
CoRR
(2022)
Yuki Saito
,
Yuto Nishimura
,
Shinnosuke Takamichi
,
Kentaro Tachibana
,
Hiroshi Saruwatari
STUDIES: Corpus of Japanese Empathetic Dialogue Speech Towards Friendly Voice Agent.
CoRR
(2022)
Takaaki Saeki
,
Kentaro Tachibana
,
Ryuichi Yamamoto
DRSpeech: Degradation-Robust Text-to-Speech Synthesis with Frame-Level and Utterance-Level Acoustic Representation Learning.
CoRR
(2022)
Takaaki Saeki
,
Kentaro Tachibana
,
Ryuichi Yamamoto
DRSpeech: Degradation-Robust Text-to-Speech Synthesis with Frame-Level and Utterance-Level Acoustic Representation Learning.
INTERSPEECH
(2022)
Yuki Saito
,
Yuto Nishimura
,
Shinnosuke Takamichi
,
Kentaro Tachibana
,
Hiroshi Saruwatari
STUDIES: Corpus of Japanese Empathetic Dialogue Speech Towards Friendly Voice Agent.
INTERSPEECH
(2022)
Reo Yoneyama
,
Ryuichi Yamamoto
,
Kentaro Tachibana
Nonparallel High-Quality Audio Super Resolution with Domain Adaptation and Resampling CycleGANs.
CoRR
(2022)
Yuto Nishimura
,
Yuki Saito
,
Shinnosuke Takamichi
,
Kentaro Tachibana
,
Hiroshi Saruwatari
Acoustic Modeling for End-to-End Empathetic Dialogue Speech Synthesis Using Linguistic and Prosodic Contexts of Dialogue History.
INTERSPEECH
(2022)
Yuma Shirahata
,
Ryuichi Yamamoto
,
Eunwoo Song
,
Ryo Terashima
,
Jae-Min Kim
,
Kentaro Tachibana
Period VITS: Variational Inference with Explicit Pitch Modeling for End-to-end Emotional Speech Synthesis.
CoRR
(2022)
Byeongseon Park
,
Ryuichi Yamamoto
,
Kentaro Tachibana
A Unified Accent Estimation Method Based on Multi-Task Learning for Japanese Text-to-Speech.
INTERSPEECH
(2022)
Yuto Nishimura
,
Yuki Saito
,
Shinnosuke Takamichi
,
Kentaro Tachibana
,
Hiroshi Saruwatari
Acoustic Modeling for End-to-End Empathetic Dialogue Speech Synthesis Using Linguistic and Prosodic Contexts of Dialogue History.
CoRR
(2022)
Ryo Terashima
,
Ryuichi Yamamoto
,
Eunwoo Song
,
Yuma Shirahata
,
Hyun-Wook Yoon
,
Jae-Min Kim
,
Kentaro Tachibana
Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation.
INTERSPEECH
(2022)
Kosuke Futamata
,
Byeongseon Park
,
Ryuichi Yamamoto
,
Kentaro Tachibana
Phrase break prediction with bidirectional encoder representations in Japanese text-to-speech synthesis.
CoRR
(2021)
Kosuke Futamata
,
Byeongseon Park
,
Ryuichi Yamamoto
,
Kentaro Tachibana
Phrase Break Prediction with Bidirectional Encoder Representations in Japanese Text-to-Speech Synthesis.
Interspeech
(2021)
Yuki Saito
,
Kei Akuzawa
,
Kentaro Tachibana
Joint Adversarial Training of Speech Recognition and Synthesis Models for Many-to-One Voice Conversion Using Phonetic Posteriorgrams.
IEICE Trans. Inf. Syst.
(9) (2020)
Shunsuke Goto
,
Kotaro Onishi
,
Yuki Saito
,
Kentaro Tachibana
,
Koichiro Mori
Face2Speech: Towards Multi-Speaker Text-to-Speech Synthesis Using an Embedding Vector Predicted from a Face Image.
INTERSPEECH
(2020)
Takuma Okamoto
,
Kentaro Tachibana
,
Tomoki Toda
,
Yoshinori Shiga
,
Hisashi Kawai
An Investigation of Subband Wavenet Vocoder Covering Entire Audible Frequency Range with Limited Acoustic Features.
ICASSP
(2018)
Kentaro Tachibana
,
Tomoki Toda
,
Yoshinori Shiga
,
Hisashi Kawai
An Investigation of Noise Shaping with Perceptual Weighting for Wavenet-Based Speech Generation.
ICASSP
(2018)
Koichi Hamada
,
Kentaro Tachibana
,
Tianqi Li
,
Hiroto Honda
,
Yusuke Uchida
Full-Body High-Resolution Anime Generation with Progressive Structure-Conditional Generative Adversarial Networks.
ECCV Workshops (3)
(2018)
Koichi Hamada
,
Kentaro Tachibana
,
Tianqi Li
,
Hiroto Honda
,
Yusuke Uchida
Full-body High-resolution Anime Generation with Progressive Structure-conditional Generative Adversarial Networks.
CoRR
(2018)
Takuma Okamoto
,
Kentaro Tachibana
,
Tomoki Toda
,
Yoshinori Shiga
,
Hisashi Kawai
Subband wavenet with overlapped single-sideband filterbanks.
ASRU
(2017)
Kentaro Tachibana
,
Tomoki Toda
,
Yoshinori Shiga
,
Hisashi Kawai
Model Integration for HMM- and DNN-Based Speech Synthesis Using Product-of-Experts Framework.
INTERSPEECH
(2016)
Yu Takahashi
,
Hiroshi Saruwatari
,
Yuki Fujihara
,
Kentaro Tachibana
,
Yoshimitsu Mori
,
Shigeki Miyabe
,
Kiyohiro Shikano
,
Akira Tanaka
Source adaptive blind signal extraction using closed-form ICA for hands-free robot spoken dialogue system.
ICASSP
(2009)
Kentaro Tachibana
,
Hiroshi Saruwatari
,
Yoshimitsu Mori
,
Shigeki Miyabe
,
Kiyohiro Shikano
,
Akira Tanaka
Efficient Blind Source Separation Combining Closed-Form Second-Order ICA and Nonclosed-Form Higher-Order ICA.
ICASSP (1)
(2007)