Login / Signup
Jan Trmal
Publication Activity (10 Years)
Years Active: 2006-2024
Publications (10 Years): 32
Top Topics
Keyword Search
Hybrid Models
Open Source
Speech Recognition
Top Venues
CoRR
INTERSPEECH
ICASSP
SLT
</>
Publications
</>
Ruizhe Huang
,
Mahsa Yarmohammadi
,
Jan Trmal
,
Jing Liu
,
Desh Raj
,
Leibny Paola García
,
Alexei V. Ivanov
,
Patrick Ehlen
,
Mingzhi Yu
,
Dan Povey
,
Sanjeev Khudanpur
ConEC: Earnings Call Dataset with Real-world Contexts for Benchmarking Contextual Speech Recognition.
LREC/COLING
(2024)
Ruizhe Huang
,
Matthew Wiesner
,
Leibny Paola García-Perera
,
Daniel Povey
,
Jan Trmal
,
Sanjeev Khudanpur
Building Keyword Search System from End-To-End Asr Systems.
ICASSP
(2023)
Guoguo Chen
,
Shuzhou Chai
,
Guan-Bo Wang
,
Jiayu Du
,
Wei-Qiang Zhang
,
Chao Weng
,
Dan Su
,
Daniel Povey
,
Jan Trmal
,
Junbo Zhang
,
Mingjie Jin
,
Sanjeev Khudanpur
,
Shinji Watanabe
,
Shuaijiang Zhao
,
Wei Zou
,
Xiangang Li
,
Xuchen Yao
,
Yongqing Wang
,
Zhao You
,
Zhiyong Yan
GigaSpeech: An Evolving, Multi-Domain ASR Corpus with 10, 000 Hours of Transcribed Audio.
Interspeech
(2021)
Guoguo Chen
,
Shuzhou Chai
,
Guanbo Wang
,
Jiayu Du
,
Wei-Qiang Zhang
,
Chao Weng
,
Dan Su
,
Daniel Povey
,
Jan Trmal
,
Junbo Zhang
,
Mingjie Jin
,
Sanjeev Khudanpur
,
Shinji Watanabe
,
Shuaijiang Zhao
,
Wei Zou
,
Xiangang Li
,
Xuchen Yao
,
Yongqing Wang
,
Yujun Wang
,
Zhao You
,
Zhiyong Yan
GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10, 000 Hours of Transcribed Audio.
CoRR
(2021)
Matthew Wiesner
,
Mousmita Sarma
,
Ashish Arora
,
Desh Raj
,
Dongji Gao
,
Ruizhe Huang
,
Supreet Preet
,
Moris Johnson
,
Zikra Iqbal
,
Nagendra Goel
,
Jan Trmal
,
Leibny Paola García-Perera
,
Sanjeev Khudanpur
Training Hybrid Models on Noisy Transliterated Transcripts for Code-Switched Speech Recognition.
Interspeech
(2021)
Piotr Zelasko
,
Sonal Joshi
,
Yiwen Shao
,
Jesús Villalba
,
Jan Trmal
,
Najim Dehak
,
Sanjeev Khudanpur
Adversarial Attacks and Defenses for Speech Recognition Systems.
CoRR
(2021)
Oliver Adams
,
Matthew Wiesner
,
Jan Trmal
,
Garrett Nicolai
,
David Yarowsky
Induced Inflection-Set Keyword Search in Speech.
SIGMORPHON
(2020)
Mirco Ravanelli
,
Jianyuan Zhong
,
Santiago Pascual
,
Pawel Swietojanski
,
Joao Monteiro
,
Jan Trmal
,
Yoshua Bengio
Multi-task self-supervised learning for Robust Speech Recognition.
CoRR
(2020)
Maarten Van Segbroeck
,
Ahmed Zaid
,
Ksenia Kutsenko
,
Cirenia Huerta
,
Tinh Nguyen
,
Xuewen Luo
,
Björn Hoffmeister
,
Jan Trmal
,
Maurizio Omologo
,
Roland Maas
DiPCo - Dinner Party Corpus.
INTERSPEECH
(2020)
Mirco Ravanelli
,
Jianyuan Zhong
,
Santiago Pascual
,
Pawel Swietojanski
,
Joao Monteiro
,
Jan Trmal
,
Yoshua Bengio
Multi-Task Self-Supervised Learning for Robust Speech Recognition.
ICASSP
(2020)
Saurabhchand Bhati
,
Chunxi Liu
,
Jesús Villalba
,
Jan Trmal
,
Sanjeev Khudanpur
,
Najim Dehak
Bottom-Up Unsupervised Word Discovery via Acoustic Units.
GlobalSIP
(2019)
Matthew Wiesner
,
Oliver Adams
,
David Yarowsky
,
Jan Trmal
,
Sanjeev Khudanpur
Zero-Shot Pronunciation Lexicons for Cross-Language Acoustic Model Transfer.
ASRU
(2019)
Maarten Van Segbroeck
,
Ahmed Zaid
,
Ksenia Kutsenko
,
Cirenia Huerta
,
Tinh Nguyen
,
Xuewen Luo
,
Björn Hoffmeister
,
Jan Trmal
,
Maurizio Omologo
,
Roland Maas
DiPCo - Dinner Party Corpus.
CoRR
(2019)
Oliver Adams
,
Matthew Wiesner
,
Jan Trmal
,
Garrett Nicolai
,
David Yarowsky
Induced Inflection-Set Keyword Search in Speech.
CoRR
(2019)
Ashish Arora
,
Paola García
,
Shinji Watanabe
,
Vimal Manohar
,
Yiwen Shao
,
Sanjeev Khudanpur
,
Chun-Chieh Chang
,
Babak Rekabdar
,
Bagher BabaAli
,
Daniel Povey
,
David Etter
,
Desh Raj
,
Hossein Hadian
,
Jan Trmal
Using ASR Methods for OCR.
ICDAR
(2019)
Matthew Wiesner
,
Chunxi Liu
,
Lucas Ondel
,
Craig Harman
,
Vimal Manohar
,
Jan Trmal
,
Zhongqiang Huang
,
Najim Dehak
,
Sanjeev Khudanpur
Automatic Speech Recognition and Topic Identification from Speech for Almost-Zero-Resource Languages.
INTERSPEECH
(2018)
Jan Svec
,
Josef V. Psutka
,
Jan Trmal
,
Lubas Smfdl
,
Pavel Ircing
,
Jan Sedmidubský
On the Use of Grapheme Models for Searching in Large Spoken Archives.
ICASSP
(2018)
Chunxi Liu
,
Matthew Wiesner
,
Shinji Watanabe
,
Craig Harman
,
Jan Trmal
,
Najim Dehak
,
Sanjeev Khudanpur
Low-Resource Contextual Topic Identification on Speech.
CoRR
(2018)
Jon Barker
,
Shinji Watanabe
,
Emmanuel Vincent
,
Jan Trmal
The Fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset, Task and Baselines.
INTERSPEECH
(2018)
Lubos Smídl
,
Jan Svec
,
Ales Prazák
,
Jan Trmal
Semi-Supervised Training of DNN-Based Acoustic Model for ATC Speech Recognition.
SPECOM
(2018)
Chunxi Liu
,
Matthew Wiesner
,
Shinji Watanabe
,
Craig Harman
,
Jan Trmal
,
Najim Dehak
,
Sanjeev Khudanpur
Low-Resource Contextual Topic Identification on Speech.
SLT
(2018)
Jon Barker
,
Shinji Watanabe
,
Emmanuel Vincent
,
Jan Trmal
The fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset, task and baselines.
CoRR
(2018)
Fred Richardson
,
Pedro A. Torres-Carrasquillo
,
Jonas Borgstrom
,
Douglas E. Sturim
,
Youngjune Gwon
,
Jesús Villalba
,
Jan Trmal
,
Nanxin Chen
,
Réda Dehak
,
Najim Dehak
The MIT Lincoln Laboratory / JHU / EPITA-LSE LRE17 System.
Odyssey
(2018)
Hossein Hadian
,
Daniel Povey
,
Hossein Sameti
,
Jan Trmal
,
Sanjeev Khudanpur
Improving LF-MMI Using Unconstrained Supervisions for ASR.
SLT
(2018)
Matthew Wiesner
,
Chunxi Liu
,
Lucas Ondel
,
Craig Harman
,
Vimal Manohar
,
Jan Trmal
,
Zhongqiang Huang
,
Sanjeev Khudanpur
,
Najim Dehak
The JHU Speech LOREHLT 2017 System: Cross-Language Transfer for Situation-Frame Detection.
CoRR
(2018)
Jan Svec
,
Josef V. Psutka
,
Lubos Smídl
,
Jan Trmal
A Relevance Score Estimation for Spoken Term Detection Based on RNN-Generated Pronunciation Embeddings.
INTERSPEECH
(2017)
Chunxi Liu
,
Jan Trmal
,
Matthew Wiesner
,
Craig Harman
,
Sanjeev Khudanpur
Topic Identification for Speech without ASR.
CoRR
(2017)
Mirko Hannemann
,
Jan Trmal
,
Lucas Ondel
,
Santosh Kesiraju
,
Lukás Burget
Bayesian joint-sequence models for grapheme-to-phoneme conversion.
ICASSP
(2017)
Chunxi Liu
,
Jan Trmal
,
Matthew Wiesner
,
Craig Harman
,
Sanjeev Khudanpur
Topic Identification for Speech Without ASR.
INTERSPEECH
(2017)
Jan Trmal
,
Matthew Wiesner
,
Vijayaditya Peddinti
,
Xiaohui Zhang
,
Pegah Ghahremani
,
Yiming Wang
,
Vimal Manohar
,
Hainan Xu
,
Daniel Povey
,
Sanjeev Khudanpur
The Kaldi OpenKWS System: Improving Low Resource Keyword Search.
INTERSPEECH
(2017)
Jan Trmal
,
Gaurav Kumar
,
Vimal Manohar
,
Sanjeev Khudanpur
,
Matt Post
,
Paul McNamee
Using of heterogeneous corpora for training of an ASR system.
CoRR
(2017)
Eleanor Chodroff
,
Matthew Maciejewski
,
Jan Trmal
,
Sanjeev Khudanpur
,
John Godfrey
New release of Mixer-6: Improved validity for phonetic study of speaker variation and identification.
LREC
(2016)
Gaurav Kumar
,
Graeme W. Blackwood
,
Jan Trmal
,
Daniel Povey
,
Sanjeev Khudanpur
A Coarse-Grained Model for Optimal Coupling of ASR and SMT Systems for Speech Translation.
EMNLP
(2015)
Chunxi Liu
,
Aren Jansen
,
Guoguo Chen
,
Keith Kintzley
,
Jan Trmal
,
Sanjeev Khudanpur
Low-resource open vocabulary keyword search using point process models.
INTERSPEECH
(2014)
Justin T. Chiu
,
Yun Wang
,
Jan Trmal
,
Daniel Povey
,
Guoguo Chen
,
Alexander I. Rudnicky
Combination of FST and CN search in spoken term detection.
INTERSPEECH
(2014)
Pegah Ghahremani
,
Bagher BabaAli
,
Daniel Povey
,
Korbinian Riedhammer
,
Jan Trmal
,
Sanjeev Khudanpur
A pitch extraction algorithm tuned for automatic speech recognition.
ICASSP
(2014)
Xiaohui Zhang
,
Jan Trmal
,
Daniel Povey
,
Sanjeev Khudanpur
Improving deep neural network acoustic models using generalized maxout networks.
ICASSP
(2014)
Jan Trmal
,
Guoguo Chen
,
Daniel Povey
,
Sanjeev Khudanpur
,
Pegah Ghahremani
,
Xiaohui Zhang
,
Vimal Manohar
,
Chunxi Liu
,
Aren Jansen
,
Dietrich Klakow
,
David Yarowsky
,
Florian Metze
A keyword search system using open source software.
SLT
(2014)
Guoguo Chen
,
Sanjeev Khudanpur
,
Daniel Povey
,
Jan Trmal
,
David Yarowsky
,
Oguz Yilmaz
Quantifying the value of pronunciation lexicons for keyword search in lowresource languages.
ICASSP
(2013)
Guoguo Chen
,
Oguz Yilmaz
,
Jan Trmal
,
Daniel Povey
,
Sanjeev Khudanpur
Using proxies for OOV keywords in the keyword search task.
ASRU
(2013)
Jan Vanek
,
Jan Trmal
,
Josef V. Psutka
,
Josef Psutka
Optimized Acoustic Likelihoods Computation for NVIDIA and ATI/AMD Graphics Processors.
IEEE Trans. Speech Audio Process.
20 (6) (2012)
Ales Prazák
,
Zdenek Loose
,
Jan Trmal
,
Josef V. Psutka
,
Josef Psutka
Captioning of Live TV Programs through Speech Recognition and Re-speaking.
TSD
(2012)
Ales Prazák
,
Zdenek Loose
,
Jan Trmal
,
Josef V. Psutka
,
Josef Psutka
Novel Approach to Live Captioning Through Re-speaking: Tailoring Speech Recognition to Re-speaker's Needs.
INTERSPEECH
(2012)
Jan Vanek
,
Jan Trmal
,
Josef V. Psutka
,
Josef Psutka
Full covariance Gaussian mixture models evaluation on GPU.
ISSPIT
(2012)
Jan Vanek
,
Jan Trmal
,
Josef V. Psutka
,
Josef Psutka
Optimization of the Gaussian Mixture Model Evaluation on GPU.
INTERSPEECH
(2011)
Jan Zelinka
,
Jan Trmal
,
Ludek Müller
Low-dimensional space transforms of posteriors in speech recognition.
INTERSPEECH
(2010)
Jan Trmal
,
Ales Prazák
,
Zdenek Loose
,
Josef Psutka
Online TV Captioning of Czech Parliamentary Sessions.
TSD
(2010)
Jan Trmal
,
Jan Zelinka
,
Ludek Müller
Adaptation of a Feedforward Artificial Neural Network Using a Linear Transform.
TSD
(2010)
Jan Trmal
,
Jan Zelinka
,
Ludek Müller
On speaker adaptive training of artificial neural networks.
INTERSPEECH
(2010)
Jan Zelinka
,
Lubos Smídl
,
Jan Trmal
,
Ludek Müller
Posterior Estimates and Transforms for Speech Recognition.
TSD
(2010)
Jindrich Matousek
,
Radek Skarnitzl
,
Pavel Machac
,
Jan Trmal
Identification and automatic detection of parasitic speech sounds.
INTERSPEECH
(2009)
Jan Trmal
,
Marek Hrúz
,
Jan Zelinka
,
Pavel Campr
,
Ludek Müller
Feature space transforms for Czech sign-language recognition.
INTERSPEECH
(2008)
Miroslav Nagy
,
Petr Hanzlícek
,
Jana Zvárová
,
Tatjana Dostálová
,
Michaela Seydlova
,
Radim Hippman
,
Lubos Smídl
,
Jan Trmal
,
Josef Psutka
Voice-controlled Data Entry in Dental Electronic Health Record.
MIE
(2008)
Jan Trmal
,
Jan Zelinka
,
Jan Vanek
,
Ludek Müller
Silence/Speech Detection Method Based on Set of Decision Graphs.
TSD
(2006)
Jan Trmal
,
Jan Vanek
,
Ludek Müller
,
Jan Zelinka
Independent components for acoustic modeling.
INTERSPEECH
(2006)