​
Login / Signup
Yamato Ohtani
Publication Activity (10 Years)
Years Active: 2006-2024
Publications (10 Years): 11
Top Topics
Fundamental Frequency
Neural Network
Gaussian Mixture
Speech Synthesis
Top Venues
INTERSPEECH
IEICE Trans. Inf. Syst.
CoRR
ICASSP
</>
Publications
</>
Yamato Ohtani
,
Takuma Okamoto
,
Tomoki Toda
,
Hisashi Kawai
FIRNet: Fundamental Frequency Controllable Fast Neural Vocoder With Trainable Finite Impulse Response Filter.
ICASSP
(2024)
Takuma Okamoto
,
Yamato Ohtani
,
Tomoki Toda
,
Hisashi Kawai
Convnext-TTS And Convnext-VC: Convnext-Based Fast End-To-End Sequence-To-Sequence Text-To-Speech And Voice Conversion.
ICASSP
(2024)
Takuma Okamoto
,
Haruki Yamashita
,
Yamato Ohtani
,
Tomoki Toda
,
Hisashi Kawai
WaveNeXt: ConvNeXt-Based Fast Neural Vocoder Without ISTFT layer.
ASRU
(2023)
Daiki Yoshioka
,
Yusuke Yasuda
,
Noriyuki Matsunaga
,
Yamato Ohtani
,
Tomoki Toda
Spoken-Text-Style Transfer with Conditional Variational Autoencoder and Content Word Storage.
INTERSPEECH
(2022)
Yi-Chiao Wu
,
Patrick Lumban Tobing
,
Kazuki Yasuhara
,
Noriyuki Matsunaga
,
Yamato Ohtani
,
Tomoki Toda
A Cyclical Approach to Synthetic and Natural Speech Mismatch Refinement of Neural Post-filter for Low-cost Text-to-speech System.
CoRR
(2022)
Noriyuki Matsunaga
,
Yamato Ohtani
,
Tatsuya Hirahara
Loss Function Considering Multiple Attributes of a Temporal Sequence for Feed-Forward Neural Networks.
IEICE Trans. Inf. Syst.
(12) (2020)
Yi-Chiao Wu
,
Patrick Lumban Tobing
,
Kazuki Yasuhara
,
Noriyuki Matsunaga
,
Yamato Ohtani
,
Tomoki Toda
A Cyclical Post-filtering Approach to Mismatch Refinement of Neural Vocoder for Text-to-speech Systems.
CoRR
(2020)
Yi-Chiao Wu
,
Patrick Lumban Tobing
,
Kazuki Yasuhara
,
Noriyuki Matsunaga
,
Yamato Ohtani
,
Tomoki Toda
A Cyclical Post-Filtering Approach to Mismatch Refinement of Neural Vocoder for Text-to-Speech Systems.
INTERSPEECH
(2020)
Noriyuki Matsunaga
,
Yamato Ohtani
,
Tatsuya Hirahara
Loss Function Considering Temporal Sequence for Feed-Forward Neural Network-Fundamental Frequency Case.
SSW
(2019)
Yamato Ohtani
,
Masatsune Tamura
,
Masahiro Morita
,
Masami Akamine
Statistical Bandwidth Extension for Speech Synthesis Based on Gaussian Mixture Model with Sub-Band Basis Spectrum Model.
IEICE Trans. Inf. Syst.
(10) (2016)
Yamato Ohtani
,
Koichiro Mori
,
Masahiro Morita
Voice Quality Control Using Perceptual Expressions for Statistical Parametric Speech Synthesis Based on Cluster Adaptive Training.
INTERSPEECH
(2016)
Yamato Ohtani
,
Yu Nasu
,
Masahiro Morita
,
Masami Akamine
Emotional transplant in statistical speech synthesis based on emotion additive model.
INTERSPEECH
(2015)
Yamato Ohtani
,
Masatsune Tamura
,
Masahiro Morita
,
Masami Akamine
GMM-based bandwidth extension using sub-band basis spectrum model.
INTERSPEECH
(2014)
Yamato Ohtani
,
Masatsune Tamura
,
Masahiro Morita
,
Takehiko Kagoshima
,
Masami Akamine
Histogram-based spectral equalization for HMM-based speech synthesis using mel-LSP.
INTERSPEECH
(2012)
Yamato Ohtani
,
Masatsune Tamura
,
Masahiro Morita
,
Takehiko Kagoshima
,
Masami Akamine
HMM-based speech synthesis using sub-band basis spectrum model.
INTERSPEECH
(2012)
Javier Latorre
,
Mark J. F. Gales
,
Sabine Buchholz
,
Kate Knill
,
Masatsune Tamura
,
Yamato Ohtani
,
Masami Akamine
Continuous F0 in the source-excitation generation for HMM-based TTS: Do we need voiced/unvoiced classification?
ICASSP
(2011)
Yamato Ohtani
,
Tomoki Toda
,
Hiroshi Saruwatari
,
Kiyohiro Shikano
Non-parallel training for many-to-many eigenvoice conversion.
ICASSP
(2010)
Yamato Ohtani
,
Tomoki Toda
,
Hiroshi Saruwatari
,
Kiyohiro Shikano
Improvements of the One-to-Many Eigenvoice Conversion System.
IEICE Trans. Inf. Syst.
(9) (2010)
Kumi Ohta
,
Tomoki Toda
,
Yamato Ohtani
,
Hiroshi Saruwatari
,
Kiyohiro Shikano
Adaptive voice-quality control based on one-to-many eigenvoice conversion.
INTERSPEECH
(2010)
Chie Hayashida
,
Tomoki Toda
,
Yamato Ohtani
,
Hiroshi Saruwatari
,
Kiyohiro Shikano
Linear transformation approaches to many-to-one voice conversion.
SSW
(2010)
Yamato Ohtani
,
Tomoki Toda
,
Hiroshi Saruwatari
,
Kiyohiro Shikano
Adaptive Training for Voice Conversion Based on Eigenvoices.
IEICE Trans. Inf. Syst.
(6) (2010)
Yamato Ohtani
,
Tomoki Toda
,
Hiroshi Saruwatari
,
Kiyohiro Shikano
Many-to-many eigenvoice conversion with reference voice.
INTERSPEECH
(2009)
Malorie Charlier
,
Yamato Ohtani
,
Tomoki Toda
,
Alexis Moinet
,
Thierry Dutoit
Cross-language voice conversion based on eigenvoices.
INTERSPEECH
(2009)
Takashi Muramatsu
,
Yamato Ohtani
,
Tomoki Toda
,
Hiroshi Saruwatari
,
Kiyohiro Shikano
Low-delay voice conversion based on maximum likelihood estimation of spectral parameter trajectory.
INTERSPEECH
(2008)
Daisuke Tani
,
Tomoki Toda
,
Yamato Ohtani
,
Hiroshi Saruwatari
,
Kiyohiro Shikano
Maximum a posteriori adaptation for many-to-one eigenvoice conversion.
INTERSPEECH
(2008)
Yamato Ohtani
,
Tomoki Toda
,
Hiroshi Saruwatari
,
Kiyohiro Shikano
An improved one-to-many eigenvoice conversion system.
INTERSPEECH
(2008)
Yamato Ohtani
,
Tomoki Toda
,
Hiroshi Saruwatari
,
Kiyohiro Shikano
Speaker adaptive training for one-to-many eigenvoice conversion based on Gaussian mixture model.
INTERSPEECH
(2007)
Tomoki Toda
,
Yamato Ohtani
,
Kiyohiro Shikano
One-to-Many and Many-to-One Voice Conversion Based on Eigenvoices.
ICASSP (4)
(2007)
Daisuke Tani
,
Yamato Ohtani
,
Tomoki Toda
,
Hiroshi Saruwatari
,
Kiyohiro Shikano
An evaluation of many-to-one voice conversion algorithms with pre-stored speaker data sets.
SSW
(2007)
Kumi Ohta
,
Yamato Ohtani
,
Tomoki Toda
,
Hiroshi Saruwatari
,
Kiyohiro Shikano
Regression approaches to voice quality controll based on one-to-many eigenvoice conversion.
SSW
(2007)
Tomoki Toda
,
Yamato Ohtani
,
Kiyohiro Shikano
Eigenvoice conversion based on Gaussian mixture model.
INTERSPEECH
(2006)
Yamato Ohtani
,
Tomoki Toda
,
Hiroshi Saruwatari
,
Kiyohiro Shikano
Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation.
INTERSPEECH
(2006)