Alex Park

Publication Activity (10 Years)

Years Active: 2001-2023
Publications (10 Years): 11

Top Topics

Speech Recognizer

Spectral Subtraction

Keyword Spotting

Top Venues

Publications

Beltrán Labrador, Pai Zhu, Guanlong Zhao, Angelo Scorza Scarpati, Quan Wang, Alicia Lozano-Diez, Alex Park, Ignacio López-Moreno
Personalizing Keyword Spotting with Speaker Information. CoRR (2023)
Pai Zhu, Hyun Jin Park, Alex Park, Angelo Scorza Scarpati, Ignacio Lopez-Moreno
Locale Encoding For Scalable Multilingual Keyword Spotting Models. CoRR (2023)
Pai Zhu, Hyun Jin Park, Alex Park, Angelo Scorza Scarpati, Ignacio López-Moreno
Locale Encoding for Scalable Multilingual Keyword Spotting Models. ICASSP (2023)
Andrew Hard, Kurt Partridge, Neng Chen, Sean Augenstein, Aishanee Shah, Hyun Jin Park, Alex Park, Sara Ng, Jessica Nguyen, Ignacio Lopez-Moreno, Rajiv Mathews, Françoise Beaufays
Production federated keyword spotting via distillation, filtering, and joint federated-centralized training. INTERSPEECH (2022)
Sankaran Panchapagesan, Arun Narayanan, Turaj Zakizadeh Shabestary, Shuai Shao, Nathan Howard, Alex Park, James Walker, Alexander Gruenstein
A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR Accuracy. INTERSPEECH (2022)
Andrew Hard, Kurt Partridge, Neng Chen, Sean Augenstein, Aishanee Shah, Hyun Jin Park, Alex Park, Sara Ng, Jessica Nguyen, Ignacio Lopez-Moreno, Rajiv Mathews, Françoise Beaufays
Production federated keyword spotting via distillation, filtering, and joint federated-centralized training. CoRR (2022)
Sankaran Panchapagesan, Arun Narayanan, Turaj Zakizadeh Shabestary, Shuai Shao, Nathan Howard, Alex Park, James Walker, Alexander Gruenstein
A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR Accuracy. CoRR (2022)
Nathan Howard, Alex Park, Turaj Zakizadeh Shabestary, Alexander Gruenstein, Rohit Prabhavalkar
A Neural Acoustic Echo Canceller Optimized Using An Automatic Speech Recognizer And Large Scale Synthetic Data. CoRR (2021)
Tom O'Malley, Arun Narayanan, Quan Wang, Alex Park, James Walker, Nathan Howard
A Conformer-Based ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement and Speech Separation. ASRU (2021)
Nathan Howard, Alex Park, Turaj Zakizadeh Shabestary, Alexander Gruenstein, Rohit Prabhavalkar
A Neural Acoustic Echo Canceller Optimized Using An Automatic Speech Recognizer and Large Scale Synthetic Data. ICASSP (2021)
Tom O'Malley, Arun Narayanan, Quan Wang, Alex Park, James Walker, Nathan Howard
A Conformer-based ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement and Speech Separation. CoRR (2021)
Igor Malioutov, Alex Park, Regina Barzilay, James R. Glass
Making Sense of Sound: Unsupervised Topic Segmentation over Acoustic Input. ACL (2007)
Ram H. Woo, Alex Park, Timothy J. Hazen
The MIT Mobile Device Speaker Verification Corpus: Data Collection and Preliminary Experiments. Odyssey (2006)
Alex Park, James R. Glass
Unsupervised Word Acquisition from Speech using Pattern Discovery. ICASSP (1) (2006)
Alex Park, James R. Glass
A Novel DTW-Based Distance Measure for speaker Segmentation. SLT (2006)
James R. Glass, Timothy J. Hazen, D. Scott Cyphers, Ken Schutte, Alex Park
The MIT Spoken Lecture Processing Project. HLT/EMNLP (2005)
Alex Park, Timothy J. Hazen, James R. Glass
Automatic Processing of Audio Lectures for Information Retrieval: Vocabulary Selection and Language Modeling. ICASSP (1) (2005)
Alex Park, Timothy J. Hazen
A comparison of normalization and training approaches for ASR-dependent speaker identification. INTERSPEECH (2004)
Timothy J. Hazen, Eugene Weinstein, Alex Park
Towards robust person recognition on handheld devices using face and speaker identification technologies. ICMI (2003)
Timothy J. Hazen, Douglas A. Jones, Alex Park, Linda C. Kukolich, Douglas A. Reynolds
Integration of speaker recognition into conversational spoken dialogue systems. INTERSPEECH (2003)
Alex Park, Timothy J. Hazen
ASR dependent techniques for speaker identification. INTERSPEECH (2002)
Timothy J. Hazen, I. Lee Hetherington, Alex Park
FST-based recognition techniques for multi-lingual and multi-domain spontaneous speech. INTERSPEECH (2001)