​
Login / Signup
Seung-Bin Kim
ORCID
Publication Activity (10 Years)
Years Active: 2022-2024
Publications (10 Years): 7
Top Topics
Variational Inference
Prosodic Features
Latent Dirichlet Allocation
Speech Synthesis
Top Venues
CoRR
ICASSP
IEEE ACM Trans. Audio Speech Lang. Process.
NeurIPS
</>
Publications
</>
Seung-Bin Kim
,
Sang-Hoon Lee
,
Seong-Whan Lee
TranSentence: speech-to-speech Translation via Language-Agnostic Sentence-Level Speech Encoding without Language-Parallel Data.
ICASSP
(2024)
Seung-Bin Kim
,
Sang-Hoon Lee
,
Ha-Yeong Choi
,
Seong-Whan Lee
Audio Super-Resolution With Robust Speech Representation Learning of Masked Autoencoder.
IEEE ACM Trans. Audio Speech Lang. Process.
32 (2024)
Seung-Bin Kim
,
Sang-Hoon Lee
,
Seong-Whan Lee
TranSentence: Speech-to-speech Translation via Language-agnostic Sentence-level Speech Encoding without Language-parallel Data.
CoRR
(2024)
Deok-Hyeon Cho
,
Hyung-Seok Oh
,
Seung-Bin Kim
,
Sang-Hoon Lee
,
Seong-Whan Lee
EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for Controllable Emotional Text-to-Speech.
CoRR
(2024)
Sang-Hoon Lee
,
Ha-Yeong Choi
,
Seung-Bin Kim
,
Seong-Whan Lee
HierSpeech++: Bridging the Gap between Semantic and Acoustic Representation of Speech by Hierarchical Variational Inference for Zero-shot Speech Synthesis.
CoRR
(2023)
Sang-Hoon Lee
,
Seung-Bin Kim
,
Ji-Hyun Lee
,
Eunwoo Song
,
Min-Jae Hwang
,
Seong-Whan Lee
HierSpeech: Bridging the Gap between Text and Speech by Hierarchical Variational Inference using Self-supervised Representations for Speech Synthesis.
NeurIPS
(2022)
Chae-Bin Im
,
Sang-Hoon Lee
,
Seung-Bin Kim
,
Seong-Whan Lee
EMOQ-TTS: Emotion Intensity Quantization for Fine-Grained Controllable Emotional Text-to-Speech.
ICASSP
(2022)