Login / Signup
Yukiya Hono
ORCID
Publication Activity (10 Years)
Years Active: 2018-2024
Publications (10 Years): 27
Top Topics
Prosodic Features
Pitman Yor Process
Autoregressive
Spoken Dialogue
Top Venues
CoRR
ICASSP
INTERSPEECH
APSIPA
</>
Publications
</>
Kentaro Mitsui
,
Koh Mitsuda
,
Toshiaki Wakatsuki
,
Yukiya Hono
,
Kei Sawada
PSLM: Parallel Generation of Text and Speech with LLMs for Low-Latency Spoken Dialogue Systems.
CoRR
(2024)
Yukiya Hono
,
Kei Hashimoto
,
Yoshihiko Nankaku
,
Keiichi Tokuda
PeriodGrad: Towards Pitch-Controllable Neural Vocoder Based on a Diffusion Probabilistic Model.
ICASSP
(2024)
Kei Sawada
,
Tianyu Zhao
,
Makoto Shing
,
Kentaro Mitsui
,
Akio Kaga
,
Yukiya Hono
,
Toshiaki Wakatsuki
,
Koh Mitsuda
Release of Pre-Trained Models for the Japanese Language.
LREC/COLING
(2024)
Kei Sawada
,
Tianyu Zhao
,
Makoto Shing
,
Kentaro Mitsui
,
Akio Kaga
,
Yukiya Hono
,
Toshiaki Wakatsuki
,
Koh Mitsuda
Release of Pre-Trained Models for the Japanese Language.
CoRR
(2024)
Yukiya Hono
,
Koh Mitsuda
,
Tianyu Zhao
,
Kentaro Mitsui
,
Toshiaki Wakatsuki
,
Kei Sawada
Integrating Pre-Trained Speech and Language Models for End-to-End Speech Recognition.
ACL (Findings)
(2024)
Yukiya Hono
,
Kei Hashimoto
,
Yoshihiko Nankaku
,
Keiichi Tokuda
PeriodGrad: Towards Pitch-Controllable Neural Vocoder Based on a Diffusion Probabilistic Model.
CoRR
(2024)
Kentaro Mitsui
,
Yukiya Hono
,
Kei Sawada
Towards human-like spoken dialogue generation between AI agents from written dialogue.
CoRR
(2023)
Miku Nishihara
,
Yukiya Hono
,
Kei Hashimoto
,
Yoshihiko Nankaku
,
Keiichi Tokuda
Singing voice synthesis based on frame-level sequence-to-sequence models considering vocal timing deviation.
CoRR
(2023)
Kentaro Mitsui
,
Yukiya Hono
,
Kei Sawada
UniFLG: Unified Facial Landmark Generator from Text or Speech.
INTERSPEECH
(2023)
Takenori Yoshimura
,
Shinji Takaki
,
Kazuhiro Nakamura
,
Keiichiro Oura
,
Yukiya Hono
,
Kei Hashimoto
,
Yoshihiko Nankaku
,
Keiichi Tokuda
Embedding a Differentiable Mel-Cepstral Synthesis Filter to a Neural Speech Synthesis System.
ICASSP
(2023)
Yukiya Hono
,
Kei Hashimoto
,
Yoshihiko Nankaku
,
Keiichi Tokuda
Singing Voice Synthesis Based on a Musical Note Position-Aware Attention Mechanism.
ICASSP
(2023)
Kentaro Mitsui
,
Yukiya Hono
,
Kei Sawada
UniFLG: Unified Facial Landmark Generator from Text or Speech.
CoRR
(2023)
Yukiya Hono
,
Koh Mitsuda
,
Tianyu Zhao
,
Kentaro Mitsui
,
Toshiaki Wakatsuki
,
Kei Sawada
An Integration of Pre-Trained Speech and Language Models for End-to-End Speech Recognition.
CoRR
(2023)
Takenori Yoshimura
,
Shinji Takaki
,
Kazuhiro Nakamura
,
Keiichiro Oura
,
Yukiya Hono
,
Kei Hashimoto
,
Yoshihiko Nankaku
,
Keiichi Tokuda
Embedding a Differentiable Mel-cepstral Synthesis Filter to a Neural Speech Synthesis System.
CoRR
(2022)
Kentaro Mitsui
,
Tianyu Zhao
,
Kei Sawada
,
Yukiya Hono
,
Yoshihiko Nankaku
,
Keiichi Tokuda
End-to-End Text-to-Speech Based on Latent Representation of Speaking Styles Using Spontaneous Dialogue.
INTERSPEECH
(2022)
Yukiya Hono
,
Kei Hashimoto
,
Yoshihiko Nankaku
,
Keiichi Tokuda
Singing Voice Synthesis Based on a Musical Note Position-Aware Attention Mechanism.
CoRR
(2022)
Kentaro Mitsui
,
Tianyu Zhao
,
Kei Sawada
,
Yukiya Hono
,
Yoshihiko Nankaku
,
Keiichi Tokuda
End-to-End Text-to-Speech Based on Latent Representation of Speaking Styles Using Spontaneous Dialogue.
CoRR
(2022)
Yukiya Hono
,
Shinji Takaki
,
Kei Hashimoto
,
Keiichiro Oura
,
Yoshihiko Nankaku
,
Keiichi Tokuda
PeriodNet: A Non-Autoregressive Raw Waveform Generative Model With a Structure Separating Periodic and Aperiodic Components.
IEEE Access
9 (2021)
Yukiya Hono
,
Kei Hashimoto
,
Keiichiro Oura
,
Yoshihiko Nankaku
,
Keiichi Tokuda
Sinsy: A Deep Neural Network-Based Singing Voice Synthesis System.
IEEE ACM Trans. Audio Speech Lang. Process.
29 (2021)
Yukiya Hono
,
Shinji Takaki
,
Kei Hashimoto
,
Keiichiro Oura
,
Yoshihiko Nankaku
,
Keiichi Tokuda
Periodnet: A Non-Autoregressive Waveform Generation Model with a Structure Separating Periodic and Aperiodic Components.
ICASSP
(2021)
Yukiya Hono
,
Kei Hashimoto
,
Keiichiro Oura
,
Yoshihiko Nankaku
,
Keiichi Tokuda
Sinsy: A Deep Neural Network-Based Singing Voice Synthesis System.
CoRR
(2021)
Yukiya Hono
,
Shinji Takaki
,
Kei Hashimoto
,
Keiichiro Oura
,
Yoshihiko Nankaku
,
Keiichi Tokuda
PeriodNet: A non-autoregressive waveform generation model with a structure separating periodic and aperiodic components.
CoRR
(2021)
Yukiya Hono
,
Kazuna Tsuboi
,
Kei Sawada
,
Kei Hashimoto
,
Keiichiro Oura
,
Yoshihiko Nankaku
,
Keiichi Tokuda
Hierarchical Multi-Grained Generative Model for Expressive Speech Synthesis.
INTERSPEECH
(2020)
Yukiya Hono
,
Kazuna Tsuboi
,
Kei Sawada
,
Kei Hashimoto
,
Keiichiro Oura
,
Yoshihiko Nankaku
,
Keiichi Tokuda
Hierarchical Multi-Grained Generative Model for Expressive Speech Synthesis.
CoRR
(2020)
Yukiya Hono
,
Kei Hashimoto
,
Keiichiro Oura
,
Yoshihiko Nankaku
,
Keiichi Tokuda
Singing Voice Synthesis Based on Generative Adversarial Networks.
ICASSP
(2019)
Koki Senda
,
Yukiya Hono
,
Kei Sawada
,
Kei Hashimoto
,
Keiichiro Oura
,
Yoshihiko Nankaku
,
Keiichi Tokuda
Singing Voice Conversion Using Posted Waveform Data on Music Social Media.
APSIPA
(2018)
Yukiya Hono
,
Shumma Murata
,
Kazuhiro Nakamura
,
Kei Hashimoto
,
Keiichiro Oura
,
Yoshihiko Nankaku
,
Keiichi Tokuda
Recent Development of the DNN-based Singing Voice Synthesis System - Sinsy.
APSIPA
(2018)