Jan Trmal

Publication Activity (10 Years)

Years Active: 2006-2024
Publications (10 Years): 32

Top Topics

Speech Recognition

Top Venues

Publications

Ruizhe Huang, Mahsa Yarmohammadi, Jan Trmal, Jing Liu, Desh Raj, Leibny Paola García, Alexei V. Ivanov, Patrick Ehlen, Mingzhi Yu, Dan Povey, Sanjeev Khudanpur
ConEC: Earnings Call Dataset with Real-world Contexts for Benchmarking Contextual Speech Recognition. LREC/COLING (2024)
Ruizhe Huang, Matthew Wiesner, Leibny Paola García-Perera, Daniel Povey, Jan Trmal, Sanjeev Khudanpur
Building Keyword Search System from End-To-End Asr Systems. ICASSP (2023)
Guoguo Chen, Shuzhou Chai, Guan-Bo Wang, Jiayu Du, Wei-Qiang Zhang, Chao Weng, Dan Su, Daniel Povey, Jan Trmal, Junbo Zhang, Mingjie Jin, Sanjeev Khudanpur, Shinji Watanabe, Shuaijiang Zhao, Wei Zou, Xiangang Li, Xuchen Yao, Yongqing Wang, Zhao You, Zhiyong Yan
GigaSpeech: An Evolving, Multi-Domain ASR Corpus with 10, 000 Hours of Transcribed Audio. Interspeech (2021)
Guoguo Chen, Shuzhou Chai, Guanbo Wang, Jiayu Du, Wei-Qiang Zhang, Chao Weng, Dan Su, Daniel Povey, Jan Trmal, Junbo Zhang, Mingjie Jin, Sanjeev Khudanpur, Shinji Watanabe, Shuaijiang Zhao, Wei Zou, Xiangang Li, Xuchen Yao, Yongqing Wang, Yujun Wang, Zhao You, Zhiyong Yan
GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10, 000 Hours of Transcribed Audio. CoRR (2021)
Matthew Wiesner, Mousmita Sarma, Ashish Arora, Desh Raj, Dongji Gao, Ruizhe Huang, Supreet Preet, Moris Johnson, Zikra Iqbal, Nagendra Goel, Jan Trmal, Leibny Paola García-Perera, Sanjeev Khudanpur
Training Hybrid Models on Noisy Transliterated Transcripts for Code-Switched Speech Recognition. Interspeech (2021)
Piotr Zelasko, Sonal Joshi, Yiwen Shao, Jesús Villalba, Jan Trmal, Najim Dehak, Sanjeev Khudanpur
Adversarial Attacks and Defenses for Speech Recognition Systems. CoRR (2021)
Oliver Adams, Matthew Wiesner, Jan Trmal, Garrett Nicolai, David Yarowsky
Induced Inflection-Set Keyword Search in Speech. SIGMORPHON (2020)
Mirco Ravanelli, Jianyuan Zhong, Santiago Pascual, Pawel Swietojanski, Joao Monteiro, Jan Trmal, Yoshua Bengio
Multi-task self-supervised learning for Robust Speech Recognition. CoRR (2020)
Maarten Van Segbroeck, Ahmed Zaid, Ksenia Kutsenko, Cirenia Huerta, Tinh Nguyen, Xuewen Luo, Björn Hoffmeister, Jan Trmal, Maurizio Omologo, Roland Maas
DiPCo - Dinner Party Corpus. INTERSPEECH (2020)
Mirco Ravanelli, Jianyuan Zhong, Santiago Pascual, Pawel Swietojanski, Joao Monteiro, Jan Trmal, Yoshua Bengio
Multi-Task Self-Supervised Learning for Robust Speech Recognition. ICASSP (2020)
Saurabhchand Bhati, Chunxi Liu, Jesús Villalba, Jan Trmal, Sanjeev Khudanpur, Najim Dehak
Bottom-Up Unsupervised Word Discovery via Acoustic Units. GlobalSIP (2019)
Matthew Wiesner, Oliver Adams, David Yarowsky, Jan Trmal, Sanjeev Khudanpur
Zero-Shot Pronunciation Lexicons for Cross-Language Acoustic Model Transfer. ASRU (2019)
Maarten Van Segbroeck, Ahmed Zaid, Ksenia Kutsenko, Cirenia Huerta, Tinh Nguyen, Xuewen Luo, Björn Hoffmeister, Jan Trmal, Maurizio Omologo, Roland Maas
DiPCo - Dinner Party Corpus. CoRR (2019)
Oliver Adams, Matthew Wiesner, Jan Trmal, Garrett Nicolai, David Yarowsky
Induced Inflection-Set Keyword Search in Speech. CoRR (2019)
Ashish Arora, Paola García, Shinji Watanabe, Vimal Manohar, Yiwen Shao, Sanjeev Khudanpur, Chun-Chieh Chang, Babak Rekabdar, Bagher BabaAli, Daniel Povey, David Etter, Desh Raj, Hossein Hadian, Jan Trmal
Using ASR Methods for OCR. ICDAR (2019)
Matthew Wiesner, Chunxi Liu, Lucas Ondel, Craig Harman, Vimal Manohar, Jan Trmal, Zhongqiang Huang, Najim Dehak, Sanjeev Khudanpur
Automatic Speech Recognition and Topic Identification from Speech for Almost-Zero-Resource Languages. INTERSPEECH (2018)
Jan Svec, Josef V. Psutka, Jan Trmal, Lubas Smfdl, Pavel Ircing, Jan Sedmidubský
On the Use of Grapheme Models for Searching in Large Spoken Archives. ICASSP (2018)
Chunxi Liu, Matthew Wiesner, Shinji Watanabe, Craig Harman, Jan Trmal, Najim Dehak, Sanjeev Khudanpur
Low-Resource Contextual Topic Identification on Speech. CoRR (2018)
Jon Barker, Shinji Watanabe, Emmanuel Vincent, Jan Trmal
The Fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset, Task and Baselines. INTERSPEECH (2018)
Lubos Smídl, Jan Svec, Ales Prazák, Jan Trmal
Semi-Supervised Training of DNN-Based Acoustic Model for ATC Speech Recognition. SPECOM (2018)
Chunxi Liu, Matthew Wiesner, Shinji Watanabe, Craig Harman, Jan Trmal, Najim Dehak, Sanjeev Khudanpur
Low-Resource Contextual Topic Identification on Speech. SLT (2018)
Jon Barker, Shinji Watanabe, Emmanuel Vincent, Jan Trmal
The fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset, task and baselines. CoRR (2018)
Fred Richardson, Pedro A. Torres-Carrasquillo, Jonas Borgstrom, Douglas E. Sturim, Youngjune Gwon, Jesús Villalba, Jan Trmal, Nanxin Chen, Réda Dehak, Najim Dehak
The MIT Lincoln Laboratory / JHU / EPITA-LSE LRE17 System. Odyssey (2018)
Hossein Hadian, Daniel Povey, Hossein Sameti, Jan Trmal, Sanjeev Khudanpur
Improving LF-MMI Using Unconstrained Supervisions for ASR. SLT (2018)
Matthew Wiesner, Chunxi Liu, Lucas Ondel, Craig Harman, Vimal Manohar, Jan Trmal, Zhongqiang Huang, Sanjeev Khudanpur, Najim Dehak
The JHU Speech LOREHLT 2017 System: Cross-Language Transfer for Situation-Frame Detection. CoRR (2018)
Jan Svec, Josef V. Psutka, Lubos Smídl, Jan Trmal
A Relevance Score Estimation for Spoken Term Detection Based on RNN-Generated Pronunciation Embeddings. INTERSPEECH (2017)
Chunxi Liu, Jan Trmal, Matthew Wiesner, Craig Harman, Sanjeev Khudanpur
Topic Identification for Speech without ASR. CoRR (2017)
Mirko Hannemann, Jan Trmal, Lucas Ondel, Santosh Kesiraju, Lukás Burget
Bayesian joint-sequence models for grapheme-to-phoneme conversion. ICASSP (2017)
Chunxi Liu, Jan Trmal, Matthew Wiesner, Craig Harman, Sanjeev Khudanpur
Topic Identification for Speech Without ASR. INTERSPEECH (2017)
Jan Trmal, Matthew Wiesner, Vijayaditya Peddinti, Xiaohui Zhang, Pegah Ghahremani, Yiming Wang, Vimal Manohar, Hainan Xu, Daniel Povey, Sanjeev Khudanpur
The Kaldi OpenKWS System: Improving Low Resource Keyword Search. INTERSPEECH (2017)
Jan Trmal, Gaurav Kumar, Vimal Manohar, Sanjeev Khudanpur, Matt Post, Paul McNamee
Using of heterogeneous corpora for training of an ASR system. CoRR (2017)
Eleanor Chodroff, Matthew Maciejewski, Jan Trmal, Sanjeev Khudanpur, John Godfrey
New release of Mixer-6: Improved validity for phonetic study of speaker variation and identification. LREC (2016)
Gaurav Kumar, Graeme W. Blackwood, Jan Trmal, Daniel Povey, Sanjeev Khudanpur
A Coarse-Grained Model for Optimal Coupling of ASR and SMT Systems for Speech Translation. EMNLP (2015)
Chunxi Liu, Aren Jansen, Guoguo Chen, Keith Kintzley, Jan Trmal, Sanjeev Khudanpur
Low-resource open vocabulary keyword search using point process models. INTERSPEECH (2014)
Justin T. Chiu, Yun Wang, Jan Trmal, Daniel Povey, Guoguo Chen, Alexander I. Rudnicky
Combination of FST and CN search in spoken term detection. INTERSPEECH (2014)
Pegah Ghahremani, Bagher BabaAli, Daniel Povey, Korbinian Riedhammer, Jan Trmal, Sanjeev Khudanpur
A pitch extraction algorithm tuned for automatic speech recognition. ICASSP (2014)
Xiaohui Zhang, Jan Trmal, Daniel Povey, Sanjeev Khudanpur
Improving deep neural network acoustic models using generalized maxout networks. ICASSP (2014)
Jan Trmal, Guoguo Chen, Daniel Povey, Sanjeev Khudanpur, Pegah Ghahremani, Xiaohui Zhang, Vimal Manohar, Chunxi Liu, Aren Jansen, Dietrich Klakow, David Yarowsky, Florian Metze
A keyword search system using open source software. SLT (2014)
Guoguo Chen, Sanjeev Khudanpur, Daniel Povey, Jan Trmal, David Yarowsky, Oguz Yilmaz
Quantifying the value of pronunciation lexicons for keyword search in lowresource languages. ICASSP (2013)
Guoguo Chen, Oguz Yilmaz, Jan Trmal, Daniel Povey, Sanjeev Khudanpur
Using proxies for OOV keywords in the keyword search task. ASRU (2013)
Jan Vanek, Jan Trmal, Josef V. Psutka, Josef Psutka
Optimized Acoustic Likelihoods Computation for NVIDIA and ATI/AMD Graphics Processors. IEEE Trans. Speech Audio Process. 20 (6) (2012)
Ales Prazák, Zdenek Loose, Jan Trmal, Josef V. Psutka, Josef Psutka
Captioning of Live TV Programs through Speech Recognition and Re-speaking. TSD (2012)
Ales Prazák, Zdenek Loose, Jan Trmal, Josef V. Psutka, Josef Psutka
Novel Approach to Live Captioning Through Re-speaking: Tailoring Speech Recognition to Re-speaker's Needs. INTERSPEECH (2012)
Jan Vanek, Jan Trmal, Josef V. Psutka, Josef Psutka
Full covariance Gaussian mixture models evaluation on GPU. ISSPIT (2012)
Jan Vanek, Jan Trmal, Josef V. Psutka, Josef Psutka
Optimization of the Gaussian Mixture Model Evaluation on GPU. INTERSPEECH (2011)
Jan Zelinka, Jan Trmal, Ludek Müller
Low-dimensional space transforms of posteriors in speech recognition. INTERSPEECH (2010)
Jan Trmal, Ales Prazák, Zdenek Loose, Josef Psutka
Online TV Captioning of Czech Parliamentary Sessions. TSD (2010)
Jan Trmal, Jan Zelinka, Ludek Müller
Adaptation of a Feedforward Artificial Neural Network Using a Linear Transform. TSD (2010)
Jan Trmal, Jan Zelinka, Ludek Müller
On speaker adaptive training of artificial neural networks. INTERSPEECH (2010)
Jan Zelinka, Lubos Smídl, Jan Trmal, Ludek Müller
Posterior Estimates and Transforms for Speech Recognition. TSD (2010)
Jindrich Matousek, Radek Skarnitzl, Pavel Machac, Jan Trmal
Identification and automatic detection of parasitic speech sounds. INTERSPEECH (2009)
Jan Trmal, Marek Hrúz, Jan Zelinka, Pavel Campr, Ludek Müller
Feature space transforms for Czech sign-language recognition. INTERSPEECH (2008)
Miroslav Nagy, Petr Hanzlícek, Jana Zvárová, Tatjana Dostálová, Michaela Seydlova, Radim Hippman, Lubos Smídl, Jan Trmal, Josef Psutka
Voice-controlled Data Entry in Dental Electronic Health Record. MIE (2008)
Jan Trmal, Jan Zelinka, Jan Vanek, Ludek Müller
Silence/Speech Detection Method Based on Set of Decision Graphs. TSD (2006)
Jan Trmal, Jan Vanek, Ludek Müller, Jan Zelinka
Independent components for acoustic modeling. INTERSPEECH (2006)