​
Login / Signup
Eunwoo Song
Publication Activity (10 Years)
Years Active: 2013-2024
Publications (10 Years): 44
Top Topics
Speech Synthesis
Language Model
Neural Network
Lightweight
Top Venues
CoRR
INTERSPEECH
ICASSP
ICEIC
</>
Publications
</>
Hyun-Wook Yoon
,
Jin-Seob Kim
,
Ryuichi Yamamoto
,
Ryo Terashima
,
Chan-Ho Song
,
Jae-Min Kim
,
Eunwoo Song
Enhancing Multilingual TTS with Voice Conversion Based Data Augmentation and Posterior Embedding.
ICASSP
(2024)
Heeseung Kim
,
Soonshin Seo
,
Kyeongseok Jeong
,
Ohsung Kwon
,
Jungwhan Kim
,
Jaehong Lee
,
Eunwoo Song
,
Myungwoo Oh
,
Sungroh Yoon
,
Kang Min Yoo
Unified Speech-Text Pretraining for Spoken Dialog Modeling.
CoRR
(2024)
Hyungchan Yoon
,
Changhwan Kim
,
Eunwoo Song
,
Hyun-Wook Yoon
,
Hong-Goo Kang
Pruning Self-Attention for Zero-Shot Multi-Speaker Text-to-Speech.
CoRR
(2023)
Hyungchan Yoon
,
Changhwan Kim
,
Eunwoo Song
,
Hyun-Wook Yoon
,
Hong-Goo Kang
Pruning Self-Attention for Zero-Shot Multi-Speaker Text-to-Speech.
INTERSPEECH
(2023)
Yuma Shirahata
,
Ryuichi Yamamoto
,
Eunwoo Song
,
Ryo Terashima
,
Jae-Min Kim
,
Kentaro Tachibana
Period VITS: Variational Inference with Explicit Pitch Modeling for End-To-End Emotional Speech Synthesis.
ICASSP
(2023)
Ryo Terashima
,
Ryuichi Yamamoto
,
Eunwoo Song
,
Yuma Shirahata
,
Hyun-Wook Yoon
,
Jae-Min Kim
,
Kentaro Tachibana
Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation.
CoRR
(2022)
Hyun-Wook Yoon
,
Ohsung Kwon
,
Hoyeon Lee
,
Ryuichi Yamamoto
,
Eunwoo Song
,
Jae-Min Kim
,
Min-Jae Hwang
Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems.
INTERSPEECH
(2022)
Hyun-Wook Yoon
,
Ohsung Kwon
,
Hoyeon Lee
,
Ryuichi Yamamoto
,
Eunwoo Song
,
Jae-Min Kim
,
Min-Jae Hwang
Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems.
CoRR
(2022)
Sang-Hoon Lee
,
Seung-Bin Kim
,
Ji-Hyun Lee
,
Eunwoo Song
,
Min-Jae Hwang
,
Seong-Whan Lee
HierSpeech: Bridging the Gap between Text and Speech by Hierarchical Variational Inference using Self-supervised Representations for Speech Synthesis.
NeurIPS
(2022)
Min-Jae Hwang
,
Hyun-Wook Yoon
,
Chan-Ho Song
,
Jin-Seob Kim
,
Jae-Min Kim
,
Eunwoo Song
Linear Prediction-based Parallel WaveGAN Speech Synthesis.
ICEIC
(2022)
Yuma Shirahata
,
Ryuichi Yamamoto
,
Eunwoo Song
,
Ryo Terashima
,
Jae-Min Kim
,
Kentaro Tachibana
Period VITS: Variational Inference with Explicit Pitch Modeling for End-to-end Emotional Speech Synthesis.
CoRR
(2022)
Ryo Terashima
,
Ryuichi Yamamoto
,
Eunwoo Song
,
Yuma Shirahata
,
Hyun-Wook Yoon
,
Jae-Min Kim
,
Kentaro Tachibana
Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation.
INTERSPEECH
(2022)
Suhyeon Oh
,
Ohsung Kwon
,
Min-Jae Hwang
,
Jae-Min Kim
,
Eunwoo Song
Effective Data Augmentation Methods for Neural Text-to-Speech Systems.
ICEIC
(2022)
Eunwoo Song
,
Ryuichi Yamamoto
,
Ohsung Kwon
,
Chan-Ho Song
,
Min-Jae Hwang
,
Suhyeon Oh
,
Hyun-Wook Yoon
,
Jin-Seob Kim
,
Jae-Min Kim
TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder.
CoRR
(2022)
Eunwoo Song
,
Ryuichi Yamamoto
,
Ohsung Kwon
,
Chan-Ho Song
,
Min-Jae Hwang
,
Suhyeon Oh
,
Hyun-Wook Yoon
,
Jin-Seob Kim
,
Jae-Min Kim
TTS-by-TTS 2: Data-Selective Augmentation for Neural Speech Synthesis Using Ranking Support Vector Machine with Variational Autoencoder.
INTERSPEECH
(2022)
Huu-Kim Nguyen
,
Kihyuk Jeong
,
Se-Yun Um
,
Min-Jae Hwang
,
Eunwoo Song
,
Hong-Goo Kang
LiteTTS: A Lightweight Mel-Spectrogram-Free Text-to-Wave Synthesizer Based on Generative Adversarial Networks.
Interspeech
(2021)
Min-Jae Hwang
,
Ryuichi Yamamoto
,
Eunwoo Song
,
Jae-Min Kim
TTS-by-TTS: TTS-Driven Data Augmentation for Fast and High-Quality Speech Synthesis.
ICASSP
(2021)
Eunwoo Song
,
Ryuichi Yamamoto
,
Min-Jae Hwang
,
Jin-Seob Kim
,
Ohsung Kwon
,
Jae-Min Kim
Improved Parallel Wavegan Vocoder with Perceptually Weighted Spectrogram Loss.
SLT
(2021)
Eunwoo Song
,
Ryuichi Yamamoto
,
Min-Jae Hwang
,
Jin-Seob Kim
,
Ohsung Kwon
,
Jae-Min Kim
Improved parallel WaveGAN vocoder with perceptually weighted spectrogram loss.
CoRR
(2021)
Ryuichi Yamamoto
,
Eunwoo Song
,
Min-Jae Hwang
,
Jae-Min Kim
Parallel Waveform Synthesis Based on Generative Adversarial Networks with Voicing-Aware Conditional Discriminators.
ICASSP
(2021)
Min-Jae Hwang
,
Ryuichi Yamamoto
,
Eunwoo Song
,
Jae-Min Kim
High-Fidelity Parallel WaveGAN with Multi-Band Harmonic-Plus-Noise Model.
Interspeech
(2021)
Min-Jae Hwang
,
Eunwoo Song
,
Ryuichi Yamamoto
,
Frank K. Soong
,
Hong-Goo Kang
Improving LPCNET-Based Text-to-Speech with Linear Prediction-Structured Mixture Density Network.
ICASSP
(2020)
Eunwoo Song
,
Jin-Seob Kim
,
Kyungguen Byun
,
Hong-Goo Kang
Speaker-Adaptive Neural Vocoders for Parametric Speech Synthesis Systems.
MMSP
(2020)
Min-Jae Hwang
,
Frank K. Soong
,
Eunwoo Song
,
Xi Wang
,
Hyeonjoo Kang
,
Hong-Goo Kang
LP-WaveNet: Linear Prediction-based WaveNet Speech Synthesis.
APSIPA
(2020)
Ryuichi Yamamoto
,
Eunwoo Song
,
Jae-Min Kim
Parallel Wavegan: A Fast Waveform Generation Model Based on Generative Adversarial Networks with Multi-Resolution Spectrogram.
ICASSP
(2020)
Eunwoo Song
,
Min-Jae Hwang
,
Ryuichi Yamamoto
,
Jin-Seob Kim
,
Ohsung Kwon
,
Jae-Min Kim
Neural Text-to-Speech with a Modeling-by-Generation Excitation Vocoder.
INTERSPEECH
(2020)
Ryuichi Yamamoto
,
Eunwoo Song
,
Min-Jae Hwang
,
Jae-Min Kim
Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators.
CoRR
(2020)
Min-Jae Hwang
,
Ryuichi Yamamoto
,
Eunwoo Song
,
Jae-Min Kim
TTS-by-TTS: TTS-driven Data Augmentation for Fast and High-Quality Speech Synthesis.
CoRR
(2020)
Suhyeon Oh
,
Hyungseob Lim
,
Kyungguen Byun
,
Min-Jae Hwang
,
Eunwoo Song
,
Hong-Goo Kang
ExcitGlow: Improving a WaveGlow-based Neural Vocoder with Linear Prediction Analysis.
APSIPA
(2020)
Ohsung Kwon
,
Eunwoo Song
,
Jae-Min Kim
,
Hong-Goo Kang
Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systems.
CoRR
(2019)
Ryuichi Yamamoto
,
Eunwoo Song
,
Jae-Min Kim
Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram.
CoRR
(2019)
Ryuichi Yamamoto
,
Eunwoo Song
,
Jae-Min Kim
Probability density distillation with generative adversarial networks for high-quality parallel waveform generation.
CoRR
(2019)
Eunwoo Song
,
Kyungguen Byun
,
Hong-Goo Kang
ExcitNet Vocoder: A Neural Excitation Model for Parametric Speech Synthesis Systems.
EUSIPCO
(2019)
Ryuichi Yamamoto
,
Eunwoo Song
,
Jae-Min Kim
Probability Density Distillation with Generative Adversarial Networks for High-Quality Parallel Waveform Generation.
INTERSPEECH
(2019)
Eunwoo Song
,
Jin-Seob Kim
,
Kyungguen Byun
,
Hong-Goo Kang
Speaker-adaptive neural vocoders for statistical parametric speech synthesis systems.
CoRR
(2018)
Joun Yeop Lee
,
Sung Jun Cheon
,
Byoung Jin Choi
,
Nam Soo Kim
,
Eunwoo Song
Acoustic Modeling Using Adversarially Trained Variational Recurrent Neural Network for Speech Synthesis.
INTERSPEECH
(2018)
Min-Jae Hwang
,
Eunwoo Song
,
Jin-Seob Kim
,
Hong-Goo Kang
A Unified Framework for the Generation of Glottal Signals in Deep Learning-based Parametric Speech Synthesis Systems.
INTERSPEECH
(2018)
Min-Jae Hwang
,
Eunwoo Song
,
Kyungguen Byun
,
Hong-Goo Kang
Modeling-By-Generation-Structured Noise Compensation Algorithm for Glottal Vocoding Speech Synthesis System.
ICASSP
(2018)
Eunwoo Song
,
Kyungguen Byun
,
Hong-Goo Kang
ExcitNet vocoder: A neural excitation model for parametric speech synthesis systems.
CoRR
(2018)
Eunwoo Song
,
Frank K. Soong
,
Hong-Goo Kang
Perceptual quality and modeling accuracy of excitation parameters in DLSTM-based speech synthesis systems.
ASRU
(2017)
Eunwoo Song
,
Frank K. Soong
,
Hong-Goo Kang
Effective Spectral and Excitation Modeling Techniques for LSTM-RNN-Based Speech Synthesis Systems.
IEEE ACM Trans. Audio Speech Lang. Process.
25 (11) (2017)
Eunwoo Song
,
Hong-Goo Kang
Multi-class learning algorithm for deep neural network-based statistical parametric speech synthesis.
EUSIPCO
(2016)
Eunwoo Song
,
Frank K. Soong
,
Hong-Goo Kang
Improved Time-Frequency Trajectory Excitation Vocoder for DNN-Based Speech Synthesis.
INTERSPEECH
(2016)
Jongeun Koo
,
Eunwoo Song
,
Eunhyeok Park
,
Dongyoung Kim
,
Junki Park
,
Sungju Ryu
,
Sungjoo Yoo
,
Jae-Joon Kim
Area-efficient one-cycle correction scheme for timing errors in flip-flop based pipelines.
A-SSCC
(2016)
Eunwoo Song
,
Hong-Goo Kang
Deep neural network-based statistical parametric speech synthesis system using improved time-frequency trajectory excitation model.
INTERSPEECH
(2015)
Eunwoo Song
,
Young-Sun Joo
,
Hong-Goo Kang
Improved time-frequency trajectory excitation modeling for a statistical parametric speech synthesis system.
ICASSP
(2015)
Kyungguen Byun
,
Eunwoo Song
,
Hwan Shim
,
Hyungjoon Lim
,
Hong-Goo Kang
A constrained two-layer compression technique for ECG waves.
EMBC
(2015)
Eunwoo Song
,
Hong-Goo Kang
,
Joonil Lee
Fixed-point implementation of MPEG-D unified speech and audio coding decoder.
DSP
(2014)
Eunwoo Song
,
Jongyoub Ryu
,
Hong-Goo Kang
Speech enhancement for pathological voice using time-frequency trajectory excitation modeling.
APSIPA
(2013)