Thilo Köhler

Publication Activity (10 Years)

Years Active: 2005-2024
Publications (10 Years): 13

Top Topics

Prosodic Features

Speech Synthesis

Top Venues

Publications

Prabhav Agrawal, Thilo Köhler, Zhiping Xiu, Prashant Serai, Qing He
Ultra-Lightweight Neural Differential DSP Vocoder for High Quality Speech Synthesis. ICASSP (2024)
Prabhav Agrawal, Thilo Köhler, Zhiping Xiu, Prashant Serai, Qing He
Ultra-lightweight Neural Differential DSP Vocoder For High Quality Speech Synthesis. CoRR (2024)
Tejas Jayashankar, Thilo Köhler, Kaustubh Kalgaonkar, Zhiping Xiu, Jilong Wu, Ju Lin, Prabhav Agrawal, Qing He
Architecture for Variable Bitrate Neural Speech Codec with Configurable Computation Complexity. ICASSP (2022)
Jason Fong, Yun Wang, Prabhav Agrawal, Vimal Manohar, Jilong Wu, Thilo Köhler, Qing He
Towards zero-shot Text-based voice editing using acoustic context conditioning, utterance embeddings, and reference encoders. CoRR (2022)
Chunyang Wu, Zhiping Xiu, Yangyang Shi, Ozlem Kalinli, Christian Fuegen, Thilo Köhler, Qing He
Transformer-Based Acoustic Modeling for Streaming Speech Synthesis. Interspeech (2021)
Qing He, Zhiping Xiu, Thilo Köhler, Jilong Wu
Multi-rate attention architecture for fast streamable Text-to-speech spectrum modeling. CoRR (2021)
Jason Fong, Jilong Wu, Prabhav Agrawal, Andrew Gibiansky, Thilo Köhler, Qing He
Improving Polyglot Speech Synthesis through Multi-task and Adversarial Learning. SSW (2021)
Qing He, Zhiping Xiu, Thilo Köhler, Jilong Wu
Multi-Rate Attention Architecture for Fast Streamable Text-to-Speech Spectrum Modeling. ICASSP (2021)
Yang Gao, Weiyi Zheng, Zhaojun Yang, Thilo Köhler, Christian Fuegen, Qing He
Interactive Text-to-Speech via Semi-supervised Style Transfer Learning. CoRR (2020)
Duc Le, Thilo Köhler, Christian Fuegen, Michael L. Seltzer
G2G: TTS-Driven Pronunciation Learning for Graphemic Hybrid ASR. ICASSP (2020)
Bichen Wu, Qing He, Peizhao Zhang, Thilo Köhler, Kurt Keutzer, Peter Vajda
FBWave: Efficient and Scalable Neural Vocoders for Streaming Text-To-Speech on the Edge. CoRR (2020)
Yang Gao, Weiyi Zheng, Zhaojun Yang, Thilo Köhler, Christian Fuegen, Qing He
Interactive Text-to-Speech System via Joint Style Analysis. INTERSPEECH (2020)
Duc Le, Thilo Köhler, Christian Fuegen, Michael L. Seltzer
G2G: TTS-Driven Pronunciation Learning for Graphemic Hybrid ASR. CoRR (2019)
Roger Hsiao, Ashish Venugopal, Thilo Köhler, Ying Zhang, Paisarn Charoenpornsawat, Andreas Zollmann, Stephan Vogel, Alan W. Black, Tanja Schultz, Alex Waibel
Optimizing components for handheld two-way speech translation for an English-iraqi Arabic system. INTERSPEECH (2006)
Maria Danninger, G. Flaherty, Keni Bernardin, Hazim Kemal Ekenel, Thilo Köhler, Robert G. Malkin, Rainer Stiefelhagen, Alex Waibel
The connector: facilitating context-aware communication. ICMI (2005)
Thilo Köhler, Christian Fügen, Sebastian Stüker, Alex Waibel
Rapid porting of ASR-systems to mobile devices. INTERSPEECH (2005)