Eunwoo Song

Publication Activity (10 Years)

Years Active: 2013-2024
Publications (10 Years): 44

Top Topics

Speech Synthesis

Top Venues

Publications

Hyun-Wook Yoon, Jin-Seob Kim, Ryuichi Yamamoto, Ryo Terashima, Chan-Ho Song, Jae-Min Kim, Eunwoo Song
Enhancing Multilingual TTS with Voice Conversion Based Data Augmentation and Posterior Embedding. ICASSP (2024)
Heeseung Kim, Soonshin Seo, Kyeongseok Jeong, Ohsung Kwon, Jungwhan Kim, Jaehong Lee, Eunwoo Song, Myungwoo Oh, Sungroh Yoon, Kang Min Yoo
Unified Speech-Text Pretraining for Spoken Dialog Modeling. CoRR (2024)
Hyungchan Yoon, Changhwan Kim, Eunwoo Song, Hyun-Wook Yoon, Hong-Goo Kang
Pruning Self-Attention for Zero-Shot Multi-Speaker Text-to-Speech. CoRR (2023)
Hyungchan Yoon, Changhwan Kim, Eunwoo Song, Hyun-Wook Yoon, Hong-Goo Kang
Pruning Self-Attention for Zero-Shot Multi-Speaker Text-to-Speech. INTERSPEECH (2023)
Yuma Shirahata, Ryuichi Yamamoto, Eunwoo Song, Ryo Terashima, Jae-Min Kim, Kentaro Tachibana
Period VITS: Variational Inference with Explicit Pitch Modeling for End-To-End Emotional Speech Synthesis. ICASSP (2023)
Ryo Terashima, Ryuichi Yamamoto, Eunwoo Song, Yuma Shirahata, Hyun-Wook Yoon, Jae-Min Kim, Kentaro Tachibana
Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation. CoRR (2022)
Hyun-Wook Yoon, Ohsung Kwon, Hoyeon Lee, Ryuichi Yamamoto, Eunwoo Song, Jae-Min Kim, Min-Jae Hwang
Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems. INTERSPEECH (2022)
Hyun-Wook Yoon, Ohsung Kwon, Hoyeon Lee, Ryuichi Yamamoto, Eunwoo Song, Jae-Min Kim, Min-Jae Hwang
Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems. CoRR (2022)
Sang-Hoon Lee, Seung-Bin Kim, Ji-Hyun Lee, Eunwoo Song, Min-Jae Hwang, Seong-Whan Lee
HierSpeech: Bridging the Gap between Text and Speech by Hierarchical Variational Inference using Self-supervised Representations for Speech Synthesis. NeurIPS (2022)
Min-Jae Hwang, Hyun-Wook Yoon, Chan-Ho Song, Jin-Seob Kim, Jae-Min Kim, Eunwoo Song
Linear Prediction-based Parallel WaveGAN Speech Synthesis. ICEIC (2022)
Yuma Shirahata, Ryuichi Yamamoto, Eunwoo Song, Ryo Terashima, Jae-Min Kim, Kentaro Tachibana
Period VITS: Variational Inference with Explicit Pitch Modeling for End-to-end Emotional Speech Synthesis. CoRR (2022)
Ryo Terashima, Ryuichi Yamamoto, Eunwoo Song, Yuma Shirahata, Hyun-Wook Yoon, Jae-Min Kim, Kentaro Tachibana
Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation. INTERSPEECH (2022)
Suhyeon Oh, Ohsung Kwon, Min-Jae Hwang, Jae-Min Kim, Eunwoo Song
Effective Data Augmentation Methods for Neural Text-to-Speech Systems. ICEIC (2022)
Eunwoo Song, Ryuichi Yamamoto, Ohsung Kwon, Chan-Ho Song, Min-Jae Hwang, Suhyeon Oh, Hyun-Wook Yoon, Jin-Seob Kim, Jae-Min Kim
TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder. CoRR (2022)
Eunwoo Song, Ryuichi Yamamoto, Ohsung Kwon, Chan-Ho Song, Min-Jae Hwang, Suhyeon Oh, Hyun-Wook Yoon, Jin-Seob Kim, Jae-Min Kim
TTS-by-TTS 2: Data-Selective Augmentation for Neural Speech Synthesis Using Ranking Support Vector Machine with Variational Autoencoder. INTERSPEECH (2022)
Huu-Kim Nguyen, Kihyuk Jeong, Se-Yun Um, Min-Jae Hwang, Eunwoo Song, Hong-Goo Kang
LiteTTS: A Lightweight Mel-Spectrogram-Free Text-to-Wave Synthesizer Based on Generative Adversarial Networks. Interspeech (2021)
Min-Jae Hwang, Ryuichi Yamamoto, Eunwoo Song, Jae-Min Kim
TTS-by-TTS: TTS-Driven Data Augmentation for Fast and High-Quality Speech Synthesis. ICASSP (2021)
Eunwoo Song, Ryuichi Yamamoto, Min-Jae Hwang, Jin-Seob Kim, Ohsung Kwon, Jae-Min Kim
Improved Parallel Wavegan Vocoder with Perceptually Weighted Spectrogram Loss. SLT (2021)
Eunwoo Song, Ryuichi Yamamoto, Min-Jae Hwang, Jin-Seob Kim, Ohsung Kwon, Jae-Min Kim
Improved parallel WaveGAN vocoder with perceptually weighted spectrogram loss. CoRR (2021)
Ryuichi Yamamoto, Eunwoo Song, Min-Jae Hwang, Jae-Min Kim
Parallel Waveform Synthesis Based on Generative Adversarial Networks with Voicing-Aware Conditional Discriminators. ICASSP (2021)
Min-Jae Hwang, Ryuichi Yamamoto, Eunwoo Song, Jae-Min Kim
High-Fidelity Parallel WaveGAN with Multi-Band Harmonic-Plus-Noise Model. Interspeech (2021)
Min-Jae Hwang, Eunwoo Song, Ryuichi Yamamoto, Frank K. Soong, Hong-Goo Kang
Improving LPCNET-Based Text-to-Speech with Linear Prediction-Structured Mixture Density Network. ICASSP (2020)
Eunwoo Song, Jin-Seob Kim, Kyungguen Byun, Hong-Goo Kang
Speaker-Adaptive Neural Vocoders for Parametric Speech Synthesis Systems. MMSP (2020)
Min-Jae Hwang, Frank K. Soong, Eunwoo Song, Xi Wang, Hyeonjoo Kang, Hong-Goo Kang
LP-WaveNet: Linear Prediction-based WaveNet Speech Synthesis. APSIPA (2020)
Ryuichi Yamamoto, Eunwoo Song, Jae-Min Kim
Parallel Wavegan: A Fast Waveform Generation Model Based on Generative Adversarial Networks with Multi-Resolution Spectrogram. ICASSP (2020)
Eunwoo Song, Min-Jae Hwang, Ryuichi Yamamoto, Jin-Seob Kim, Ohsung Kwon, Jae-Min Kim
Neural Text-to-Speech with a Modeling-by-Generation Excitation Vocoder. INTERSPEECH (2020)
Ryuichi Yamamoto, Eunwoo Song, Min-Jae Hwang, Jae-Min Kim
Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators. CoRR (2020)
Min-Jae Hwang, Ryuichi Yamamoto, Eunwoo Song, Jae-Min Kim
TTS-by-TTS: TTS-driven Data Augmentation for Fast and High-Quality Speech Synthesis. CoRR (2020)
Suhyeon Oh, Hyungseob Lim, Kyungguen Byun, Min-Jae Hwang, Eunwoo Song, Hong-Goo Kang
ExcitGlow: Improving a WaveGlow-based Neural Vocoder with Linear Prediction Analysis. APSIPA (2020)
Ohsung Kwon, Eunwoo Song, Jae-Min Kim, Hong-Goo Kang
Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systems. CoRR (2019)
Ryuichi Yamamoto, Eunwoo Song, Jae-Min Kim
Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram. CoRR (2019)
Ryuichi Yamamoto, Eunwoo Song, Jae-Min Kim
Probability density distillation with generative adversarial networks for high-quality parallel waveform generation. CoRR (2019)
Eunwoo Song, Kyungguen Byun, Hong-Goo Kang
ExcitNet Vocoder: A Neural Excitation Model for Parametric Speech Synthesis Systems. EUSIPCO (2019)
Ryuichi Yamamoto, Eunwoo Song, Jae-Min Kim
Probability Density Distillation with Generative Adversarial Networks for High-Quality Parallel Waveform Generation. INTERSPEECH (2019)
Eunwoo Song, Jin-Seob Kim, Kyungguen Byun, Hong-Goo Kang
Speaker-adaptive neural vocoders for statistical parametric speech synthesis systems. CoRR (2018)
Joun Yeop Lee, Sung Jun Cheon, Byoung Jin Choi, Nam Soo Kim, Eunwoo Song
Acoustic Modeling Using Adversarially Trained Variational Recurrent Neural Network for Speech Synthesis. INTERSPEECH (2018)
Min-Jae Hwang, Eunwoo Song, Jin-Seob Kim, Hong-Goo Kang
A Unified Framework for the Generation of Glottal Signals in Deep Learning-based Parametric Speech Synthesis Systems. INTERSPEECH (2018)
Min-Jae Hwang, Eunwoo Song, Kyungguen Byun, Hong-Goo Kang
Modeling-By-Generation-Structured Noise Compensation Algorithm for Glottal Vocoding Speech Synthesis System. ICASSP (2018)
Eunwoo Song, Kyungguen Byun, Hong-Goo Kang
ExcitNet vocoder: A neural excitation model for parametric speech synthesis systems. CoRR (2018)
Eunwoo Song, Frank K. Soong, Hong-Goo Kang
Perceptual quality and modeling accuracy of excitation parameters in DLSTM-based speech synthesis systems. ASRU (2017)
Eunwoo Song, Frank K. Soong, Hong-Goo Kang
Effective Spectral and Excitation Modeling Techniques for LSTM-RNN-Based Speech Synthesis Systems. IEEE ACM Trans. Audio Speech Lang. Process. 25 (11) (2017)
Eunwoo Song, Hong-Goo Kang
Multi-class learning algorithm for deep neural network-based statistical parametric speech synthesis. EUSIPCO (2016)
Eunwoo Song, Frank K. Soong, Hong-Goo Kang
Improved Time-Frequency Trajectory Excitation Vocoder for DNN-Based Speech Synthesis. INTERSPEECH (2016)
Jongeun Koo, Eunwoo Song, Eunhyeok Park, Dongyoung Kim, Junki Park, Sungju Ryu, Sungjoo Yoo, Jae-Joon Kim
Area-efficient one-cycle correction scheme for timing errors in flip-flop based pipelines. A-SSCC (2016)
Eunwoo Song, Hong-Goo Kang
Deep neural network-based statistical parametric speech synthesis system using improved time-frequency trajectory excitation model. INTERSPEECH (2015)
Eunwoo Song, Young-Sun Joo, Hong-Goo Kang
Improved time-frequency trajectory excitation modeling for a statistical parametric speech synthesis system. ICASSP (2015)
Kyungguen Byun, Eunwoo Song, Hwan Shim, Hyungjoon Lim, Hong-Goo Kang
A constrained two-layer compression technique for ECG waves. EMBC (2015)
Eunwoo Song, Hong-Goo Kang, Joonil Lee
Fixed-point implementation of MPEG-D unified speech and audio coding decoder. DSP (2014)
Eunwoo Song, Jongyoub Ryu, Hong-Goo Kang
Speech enhancement for pathological voice using time-frequency trajectory excitation modeling. APSIPA (2013)