Login / Signup
Thilo Köhler
Publication Activity (10 Years)
Years Active: 2005-2024
Publications (10 Years): 13
Top Topics
Prosodic Features
Speech Synthesis
Bird Species
High Quality
Top Venues
CoRR
ICASSP
INTERSPEECH
SSW
</>
Publications
</>
Prabhav Agrawal
,
Thilo Köhler
,
Zhiping Xiu
,
Prashant Serai
,
Qing He
Ultra-Lightweight Neural Differential DSP Vocoder for High Quality Speech Synthesis.
ICASSP
(2024)
Prabhav Agrawal
,
Thilo Köhler
,
Zhiping Xiu
,
Prashant Serai
,
Qing He
Ultra-lightweight Neural Differential DSP Vocoder For High Quality Speech Synthesis.
CoRR
(2024)
Tejas Jayashankar
,
Thilo Köhler
,
Kaustubh Kalgaonkar
,
Zhiping Xiu
,
Jilong Wu
,
Ju Lin
,
Prabhav Agrawal
,
Qing He
Architecture for Variable Bitrate Neural Speech Codec with Configurable Computation Complexity.
ICASSP
(2022)
Jason Fong
,
Yun Wang
,
Prabhav Agrawal
,
Vimal Manohar
,
Jilong Wu
,
Thilo Köhler
,
Qing He
Towards zero-shot Text-based voice editing using acoustic context conditioning, utterance embeddings, and reference encoders.
CoRR
(2022)
Chunyang Wu
,
Zhiping Xiu
,
Yangyang Shi
,
Ozlem Kalinli
,
Christian Fuegen
,
Thilo Köhler
,
Qing He
Transformer-Based Acoustic Modeling for Streaming Speech Synthesis.
Interspeech
(2021)
Qing He
,
Zhiping Xiu
,
Thilo Köhler
,
Jilong Wu
Multi-rate attention architecture for fast streamable Text-to-speech spectrum modeling.
CoRR
(2021)
Jason Fong
,
Jilong Wu
,
Prabhav Agrawal
,
Andrew Gibiansky
,
Thilo Köhler
,
Qing He
Improving Polyglot Speech Synthesis through Multi-task and Adversarial Learning.
SSW
(2021)
Qing He
,
Zhiping Xiu
,
Thilo Köhler
,
Jilong Wu
Multi-Rate Attention Architecture for Fast Streamable Text-to-Speech Spectrum Modeling.
ICASSP
(2021)
Yang Gao
,
Weiyi Zheng
,
Zhaojun Yang
,
Thilo Köhler
,
Christian Fuegen
,
Qing He
Interactive Text-to-Speech via Semi-supervised Style Transfer Learning.
CoRR
(2020)
Duc Le
,
Thilo Köhler
,
Christian Fuegen
,
Michael L. Seltzer
G2G: TTS-Driven Pronunciation Learning for Graphemic Hybrid ASR.
ICASSP
(2020)
Bichen Wu
,
Qing He
,
Peizhao Zhang
,
Thilo Köhler
,
Kurt Keutzer
,
Peter Vajda
FBWave: Efficient and Scalable Neural Vocoders for Streaming Text-To-Speech on the Edge.
CoRR
(2020)
Yang Gao
,
Weiyi Zheng
,
Zhaojun Yang
,
Thilo Köhler
,
Christian Fuegen
,
Qing He
Interactive Text-to-Speech System via Joint Style Analysis.
INTERSPEECH
(2020)
Duc Le
,
Thilo Köhler
,
Christian Fuegen
,
Michael L. Seltzer
G2G: TTS-Driven Pronunciation Learning for Graphemic Hybrid ASR.
CoRR
(2019)
Roger Hsiao
,
Ashish Venugopal
,
Thilo Köhler
,
Ying Zhang
,
Paisarn Charoenpornsawat
,
Andreas Zollmann
,
Stephan Vogel
,
Alan W. Black
,
Tanja Schultz
,
Alex Waibel
Optimizing components for handheld two-way speech translation for an English-iraqi Arabic system.
INTERSPEECH
(2006)
Maria Danninger
,
G. Flaherty
,
Keni Bernardin
,
Hazim Kemal Ekenel
,
Thilo Köhler
,
Robert G. Malkin
,
Rainer Stiefelhagen
,
Alex Waibel
The connector: facilitating context-aware communication.
ICMI
(2005)
Thilo Köhler
,
Christian Fügen
,
Sebastian Stüker
,
Alex Waibel
Rapid porting of ASR-systems to mobile devices.
INTERSPEECH
(2005)