​
Login / Signup
Jae-Min Kim
ORCID
Publication Activity (10 Years)
Years Active: 2017-2024
Publications (10 Years): 31
Top Topics
Speech Synthesis
Top Venues
CoRR
ICASSP
INTERSPEECH
Sensors
</>
Publications
</>
Hyun-Wook Yoon
,
Jin-Seob Kim
,
Ryuichi Yamamoto
,
Ryo Terashima
,
Chan-Ho Song
,
Jae-Min Kim
,
Eunwoo Song
Enhancing Multilingual TTS with Voice Conversion Based Data Augmentation and Posterior Embedding.
ICASSP
(2024)
Hoyeon Lee
,
Hyun-Wook Yoon
,
Jong-Hwan Kim
,
Jae-Min Kim
Cross-Lingual Transfer Learning for Phrase Break Prediction with Multilingual Language Model.
INTERSPEECH
(2023)
Hoyeon Lee
,
Hyun-Wook Yoon
,
Jong-Hwan Kim
,
Jae-Min Kim
Cross-Lingual Transfer Learning for Phrase Break Prediction with Multilingual Language Model.
CoRR
(2023)
Yuma Shirahata
,
Ryuichi Yamamoto
,
Eunwoo Song
,
Ryo Terashima
,
Jae-Min Kim
,
Kentaro Tachibana
Period VITS: Variational Inference with Explicit Pitch Modeling for End-To-End Emotional Speech Synthesis.
ICASSP
(2023)
Ryo Terashima
,
Ryuichi Yamamoto
,
Eunwoo Song
,
Yuma Shirahata
,
Hyun-Wook Yoon
,
Jae-Min Kim
,
Kentaro Tachibana
Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation.
CoRR
(2022)
Hyun-Wook Yoon
,
Ohsung Kwon
,
Hoyeon Lee
,
Ryuichi Yamamoto
,
Eunwoo Song
,
Jae-Min Kim
,
Min-Jae Hwang
Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems.
INTERSPEECH
(2022)
Hyun-Wook Yoon
,
Ohsung Kwon
,
Hoyeon Lee
,
Ryuichi Yamamoto
,
Eunwoo Song
,
Jae-Min Kim
,
Min-Jae Hwang
Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems.
CoRR
(2022)
Min-Jae Hwang
,
Hyun-Wook Yoon
,
Chan-Ho Song
,
Jin-Seob Kim
,
Jae-Min Kim
,
Eunwoo Song
Linear Prediction-based Parallel WaveGAN Speech Synthesis.
ICEIC
(2022)
Yuma Shirahata
,
Ryuichi Yamamoto
,
Eunwoo Song
,
Ryo Terashima
,
Jae-Min Kim
,
Kentaro Tachibana
Period VITS: Variational Inference with Explicit Pitch Modeling for End-to-end Emotional Speech Synthesis.
CoRR
(2022)
Hwa-Yeon Kim
,
Jong-Hwan Kim
,
Jae-Min Kim
Fast Bilingual Grapheme-To-Phoneme Conversion.
NAACL-HLT (Industry Papers)
(2022)
Ryo Terashima
,
Ryuichi Yamamoto
,
Eunwoo Song
,
Yuma Shirahata
,
Hyun-Wook Yoon
,
Jae-Min Kim
,
Kentaro Tachibana
Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation.
INTERSPEECH
(2022)
Suhyeon Oh
,
Ohsung Kwon
,
Min-Jae Hwang
,
Jae-Min Kim
,
Eunwoo Song
Effective Data Augmentation Methods for Neural Text-to-Speech Systems.
ICEIC
(2022)
Eunwoo Song
,
Ryuichi Yamamoto
,
Ohsung Kwon
,
Chan-Ho Song
,
Min-Jae Hwang
,
Suhyeon Oh
,
Hyun-Wook Yoon
,
Jin-Seob Kim
,
Jae-Min Kim
TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder.
CoRR
(2022)
Eunwoo Song
,
Ryuichi Yamamoto
,
Ohsung Kwon
,
Chan-Ho Song
,
Min-Jae Hwang
,
Suhyeon Oh
,
Hyun-Wook Yoon
,
Jin-Seob Kim
,
Jae-Min Kim
TTS-by-TTS 2: Data-Selective Augmentation for Neural Speech Synthesis Using Ranking Support Vector Machine with Variational Autoencoder.
INTERSPEECH
(2022)
Min-Jae Hwang
,
Ryuichi Yamamoto
,
Eunwoo Song
,
Jae-Min Kim
TTS-by-TTS: TTS-Driven Data Augmentation for Fast and High-Quality Speech Synthesis.
ICASSP
(2021)
Eunwoo Song
,
Ryuichi Yamamoto
,
Min-Jae Hwang
,
Jin-Seob Kim
,
Ohsung Kwon
,
Jae-Min Kim
Improved Parallel Wavegan Vocoder with Perceptually Weighted Spectrogram Loss.
SLT
(2021)
Eunbi Choi
,
Hwa-Yeon Kim
,
Jong-Hwan Kim
,
Jae-Min Kim
Label Embedding for Chinese Grapheme-to-Phoneme Conversion.
Interspeech
(2021)
Hwa-Yeon Kim
,
Jong-Hwan Kim
,
Jae-Min Kim
NN-KOG2P: A Novel Grapheme-to-Phoneme Model for Korean Language.
ICASSP
(2021)
Eunwoo Song
,
Ryuichi Yamamoto
,
Min-Jae Hwang
,
Jin-Seob Kim
,
Ohsung Kwon
,
Jae-Min Kim
Improved parallel WaveGAN vocoder with perceptually weighted spectrogram loss.
CoRR
(2021)
Ryuichi Yamamoto
,
Eunwoo Song
,
Min-Jae Hwang
,
Jae-Min Kim
Parallel Waveform Synthesis Based on Generative Adversarial Networks with Voicing-Aware Conditional Discriminators.
ICASSP
(2021)
Min-Jae Hwang
,
Ryuichi Yamamoto
,
Eunwoo Song
,
Jae-Min Kim
High-Fidelity Parallel WaveGAN with Multi-Band Harmonic-Plus-Noise Model.
Interspeech
(2021)
Ryuichi Yamamoto
,
Eunwoo Song
,
Jae-Min Kim
Parallel Wavegan: A Fast Waveform Generation Model Based on Generative Adversarial Networks with Multi-Resolution Spectrogram.
ICASSP
(2020)
Eunwoo Song
,
Min-Jae Hwang
,
Ryuichi Yamamoto
,
Jin-Seob Kim
,
Ohsung Kwon
,
Jae-Min Kim
Neural Text-to-Speech with a Modeling-by-Generation Excitation Vocoder.
INTERSPEECH
(2020)
Ryuichi Yamamoto
,
Eunwoo Song
,
Min-Jae Hwang
,
Jae-Min Kim
Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators.
CoRR
(2020)
Min-Jae Hwang
,
Ryuichi Yamamoto
,
Eunwoo Song
,
Jae-Min Kim
TTS-by-TTS: TTS-driven Data Augmentation for Fast and High-Quality Speech Synthesis.
CoRR
(2020)
Ohsung Kwon
,
Eunwoo Song
,
Jae-Min Kim
,
Hong-Goo Kang
Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systems.
CoRR
(2019)
Ryuichi Yamamoto
,
Eunwoo Song
,
Jae-Min Kim
Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram.
CoRR
(2019)
Ryuichi Yamamoto
,
Eunwoo Song
,
Jae-Min Kim
Probability density distillation with generative adversarial networks for high-quality parallel waveform generation.
CoRR
(2019)
Ryuichi Yamamoto
,
Eunwoo Song
,
Jae-Min Kim
Probability Density Distillation with Generative Adversarial Networks for High-Quality Parallel Waveform Generation.
INTERSPEECH
(2019)
Kyung-Joon Shin
,
Seong-Cheol Lee
,
Yun Yong Kim
,
Jae-Min Kim
,
Seunghee Park
,
Hwanwoo Lee
Construction Condition and Damage Monitoring of Post-Tensioned PSC Girders Using Embedded Sensors.
Sensors
17 (8) (2017)
Jae-Min Kim
,
Chul-Min Kim
,
Song-yi Choi
,
Bang Yeon Lee
Enhanced Strain Measurement Range of an FBG Sensor Embedded in Seven-Wire Steel Strands.
Sensors
17 (7) (2017)