Login / Signup
Puming Zhan
Publication Activity (10 Years)
Years Active: 1996-2024
Publications (10 Years): 17
Top Topics
Language Model
Automatic Speech Recognition
Noisy Environments
Density Ratio
Top Venues
CoRR
INTERSPEECH
ASRU
ICASSP
</>
Publications
</>
Dario Albesano
,
Nicola Ferri
,
Felix Weninger
,
Puming Zhan
Improving Speed/Accuracy Tradeoff for Online Streaming ASR via Real-Valued and Trainable Strides.
ICASSP
(2024)
Felix Weninger
,
Marco Gaudesi
,
Md. Akmal Haidar
,
Nicola Ferri
,
Jesús Andrés-Ferrer
,
Puming Zhan
Conformer with dual-mode chunked attention for joint online and offline ASR.
INTERSPEECH
(2022)
Dario Albesano
,
Jesús Andrés-Ferrer
,
Nicola Ferri
,
Puming Zhan
On the Prediction Network Architecture in RNN-T for ASR.
CoRR
(2022)
Dario Albesano
,
Jesús Andrés-Ferrer
,
Nicola Ferri
,
Puming Zhan
On the Prediction Network Architecture in RNN-T for ASR.
INTERSPEECH
(2022)
Jesús Andrés-Ferrer
,
Dario Albesano
,
Puming Zhan
,
Paul Vozila
Contextual Density Ratio for Language Model Biasing of Sequence to Sequence ASR Systems.
CoRR
(2022)
Jesús Andrés-Ferrer
,
Dario Albesano
,
Puming Zhan
,
Paul Vozila
Contextual Density Ratio for Language Model Biasing of Sequence to Sequence ASR Systems.
Interspeech
(2021)
Marco Gaudesi
,
Felix Weninger
,
Dushyant Sharma
,
Puming Zhan
ChannelAugment: Improving generalization of multi-channel ASR by training with input channel randomization.
CoRR
(2021)
Marco Gaudesi
,
Felix Weninger
,
Dushyant Sharma
,
Puming Zhan
ChannelAugment: Improving Generalization of Multi-Channel ASR by Training with Input Channel Randomization.
ASRU
(2021)
Felix Weninger
,
Marco Gaudesi
,
Ralf Leibold
,
Roberto Gemello
,
Puming Zhan
Dual-Encoder Architecture with Encoder Selection for Joint Close-Talk and Far-Talk Speech Recognition.
CoRR
(2021)
Felix Weninger
,
Marco Gaudesi
,
Ralf Leibold
,
Roberto Gemello
,
Puming Zhan
Dual-Encoder Architecture with Encoder Selection for Joint Close-Talk and Far-Talk Speech Recognition.
ASRU
(2021)
Felix Weninger
,
Franco Mana
,
Roberto Gemello
,
Jesús Andrés-Ferrer
,
Puming Zhan
Semi-Supervised Learning with Data Augmentation for End-to-End ASR.
INTERSPEECH
(2020)
Felix Weninger
,
Franco Mana
,
Roberto Gemello
,
Jesús Andrés-Ferrer
,
Puming Zhan
Semi-Supervised Learning with Data Augmentation for End-to-End ASR.
CoRR
(2020)
Felix Weninger
,
Jesús Andrés-Ferrer
,
Xinwei Li
,
Puming Zhan
Listen, Attend, Spell and Adapt: Speaker Adapted Sequence-to-Sequence ASR.
INTERSPEECH
(2019)
Franco Mana
,
Felix Weninger
,
Roberto Gemello
,
Puming Zhan
Online Batch Normalization Adaptation for Automatic Speech Recognition.
ASRU
(2019)
Felix Weninger
,
Yang Sun
,
Junho Park
,
Daniel Willett
,
Puming Zhan
Deep Learning Based Mandarin Accent Identification for Accent Robust ASR.
INTERSPEECH
(2019)
Felix Weninger
,
Jesús Andrés-Ferrer
,
Xinwei Li
,
Puming Zhan
Listen, Attend, Spell and Adapt: Speaker Adapted Sequence-to-Sequence ASR.
CoRR
(2019)
Matthew Gibson
,
Gary Cook
,
Puming Zhan
Semi-supervised training strategies for deep neural networks.
ASRU
(2017)
Takashi Fukuda
,
Ryuki Tachibana
,
Upendra V. Chaudhari
,
Bhuvana Ramabhadran
,
Puming Zhan
Constructing ensembles of dissimilar acoustic models using hidden attributes of training data.
ICASSP
(2012)
Ryuki Tachibana
,
Takashi Fukuda
,
Upendra V. Chaudhari
,
Bhuvana Ramabhadran
,
Puming Zhan
Frame-level AnyBoost for LVCSR with the MMI Criterion.
ASRU
(2011)
Steven Wegmann
,
Puming Zhan
,
Larry Gillick
Progress in Broadcast News transcription at Dragon Systems.
ICASSP
(1999)
Steven Wegmann
,
Puming Zhan
,
Ira Carp
,
Michael Newman
,
Jon Yamron
,
Larry Gillick
Dragon systems' 1998 broadcast news transcription system.
EUROSPEECH
(1999)
Puming Zhan
,
Martin Westphal
Speaker normalization based on frequency warping.
ICASSP
(1997)
Alon Lavie
,
Alex Waibel
,
Lori S. Levin
,
Michael Finke
,
Donna Gates
,
Marsal Gavaldà
,
Torsten Zeppenfeld
,
Puming Zhan
Janus-III: speech-to-speech translation in multiple languages.
ICASSP
(1997)
Puming Zhan
,
Martin Westphal
,
Michael Finke
,
Alex Waibel
Speaker normalization and speaker adaptation - a combination for conversational speech recognition.
EUROSPEECH
(1997)
Alex Waibel
,
Michael Finke
,
Donna Gates
,
Marsal Gavaldà
,
Thomas Kemp
,
Alon Lavie
,
Lori S. Levin
,
Martin Maier
,
Laura Mayfield
,
Arthur E. McNair
,
Ivica Rogina
,
Kaori Shima
,
Tilo Sloboda
,
Monika Woszczyna
,
Torsten Zeppenfeld
,
Puming Zhan
JANUS-II-translation of spontaneous conversational speech.
ICASSP
(1996)
Puming Zhan
,
Klaus Ries
,
Marsal Gavaldà
,
Donna Gates
,
Alon Lavie
,
Alex Waibel
JANUS-II: towards spontaneous Spanish speech recognition.
ICSLP
(1996)
Alon Lavie
,
Alex Waibel
,
Lori S. Levin
,
Donna Gates
,
Marsal Gavaldà
,
Torsten Zeppenfeld
,
Puming Zhan
,
Oren Glickman
Translation of conversational speech with JANUS-II.
ICSLP
(1996)
Donna Gates
,
Alon Lavie
,
Lori S. Levin
,
Alex Waibel
,
Marsal Gavaldà
,
Laura Mayfield
,
Monika Woszczyna
,
Puming Zhan
End-to-End Evaluation in JANUS: A Speech-to-speech Translation System.
ECAI Workshop on Dialogue Processing in Spoken Language Systems
(1996)