Login / Signup
Alex Park
Publication Activity (10 Years)
Years Active: 2001-2023
Publications (10 Years): 11
Top Topics
Speech Recognizer
Synthetic Data
Spectral Subtraction
Keyword Spotting
Top Venues
CoRR
ICASSP
INTERSPEECH
ASRU
</>
Publications
</>
Beltrán Labrador
,
Pai Zhu
,
Guanlong Zhao
,
Angelo Scorza Scarpati
,
Quan Wang
,
Alicia Lozano-Diez
,
Alex Park
,
Ignacio López-Moreno
Personalizing Keyword Spotting with Speaker Information.
CoRR
(2023)
Pai Zhu
,
Hyun Jin Park
,
Alex Park
,
Angelo Scorza Scarpati
,
Ignacio Lopez-Moreno
Locale Encoding For Scalable Multilingual Keyword Spotting Models.
CoRR
(2023)
Pai Zhu
,
Hyun Jin Park
,
Alex Park
,
Angelo Scorza Scarpati
,
Ignacio López-Moreno
Locale Encoding for Scalable Multilingual Keyword Spotting Models.
ICASSP
(2023)
Andrew Hard
,
Kurt Partridge
,
Neng Chen
,
Sean Augenstein
,
Aishanee Shah
,
Hyun Jin Park
,
Alex Park
,
Sara Ng
,
Jessica Nguyen
,
Ignacio Lopez-Moreno
,
Rajiv Mathews
,
Françoise Beaufays
Production federated keyword spotting via distillation, filtering, and joint federated-centralized training.
INTERSPEECH
(2022)
Sankaran Panchapagesan
,
Arun Narayanan
,
Turaj Zakizadeh Shabestary
,
Shuai Shao
,
Nathan Howard
,
Alex Park
,
James Walker
,
Alexander Gruenstein
A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR Accuracy.
INTERSPEECH
(2022)
Andrew Hard
,
Kurt Partridge
,
Neng Chen
,
Sean Augenstein
,
Aishanee Shah
,
Hyun Jin Park
,
Alex Park
,
Sara Ng
,
Jessica Nguyen
,
Ignacio Lopez-Moreno
,
Rajiv Mathews
,
Françoise Beaufays
Production federated keyword spotting via distillation, filtering, and joint federated-centralized training.
CoRR
(2022)
Sankaran Panchapagesan
,
Arun Narayanan
,
Turaj Zakizadeh Shabestary
,
Shuai Shao
,
Nathan Howard
,
Alex Park
,
James Walker
,
Alexander Gruenstein
A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR Accuracy.
CoRR
(2022)
Nathan Howard
,
Alex Park
,
Turaj Zakizadeh Shabestary
,
Alexander Gruenstein
,
Rohit Prabhavalkar
A Neural Acoustic Echo Canceller Optimized Using An Automatic Speech Recognizer And Large Scale Synthetic Data.
CoRR
(2021)
Tom O'Malley
,
Arun Narayanan
,
Quan Wang
,
Alex Park
,
James Walker
,
Nathan Howard
A Conformer-Based ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement and Speech Separation.
ASRU
(2021)
Nathan Howard
,
Alex Park
,
Turaj Zakizadeh Shabestary
,
Alexander Gruenstein
,
Rohit Prabhavalkar
A Neural Acoustic Echo Canceller Optimized Using An Automatic Speech Recognizer and Large Scale Synthetic Data.
ICASSP
(2021)
Tom O'Malley
,
Arun Narayanan
,
Quan Wang
,
Alex Park
,
James Walker
,
Nathan Howard
A Conformer-based ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement and Speech Separation.
CoRR
(2021)
Igor Malioutov
,
Alex Park
,
Regina Barzilay
,
James R. Glass
Making Sense of Sound: Unsupervised Topic Segmentation over Acoustic Input.
ACL
(2007)
Ram H. Woo
,
Alex Park
,
Timothy J. Hazen
The MIT Mobile Device Speaker Verification Corpus: Data Collection and Preliminary Experiments.
Odyssey
(2006)
Alex Park
,
James R. Glass
Unsupervised Word Acquisition from Speech using Pattern Discovery.
ICASSP (1)
(2006)
Alex Park
,
James R. Glass
A Novel DTW-Based Distance Measure for speaker Segmentation.
SLT
(2006)
James R. Glass
,
Timothy J. Hazen
,
D. Scott Cyphers
,
Ken Schutte
,
Alex Park
The MIT Spoken Lecture Processing Project.
HLT/EMNLP
(2005)
Alex Park
,
Timothy J. Hazen
,
James R. Glass
Automatic Processing of Audio Lectures for Information Retrieval: Vocabulary Selection and Language Modeling.
ICASSP (1)
(2005)
Alex Park
,
Timothy J. Hazen
A comparison of normalization and training approaches for ASR-dependent speaker identification.
INTERSPEECH
(2004)
Timothy J. Hazen
,
Eugene Weinstein
,
Alex Park
Towards robust person recognition on handheld devices using face and speaker identification technologies.
ICMI
(2003)
Timothy J. Hazen
,
Douglas A. Jones
,
Alex Park
,
Linda C. Kukolich
,
Douglas A. Reynolds
Integration of speaker recognition into conversational spoken dialogue systems.
INTERSPEECH
(2003)
Alex Park
,
Timothy J. Hazen
ASR dependent techniques for speaker identification.
INTERSPEECH
(2002)
Timothy J. Hazen
,
I. Lee Hetherington
,
Alex Park
FST-based recognition techniques for multi-lingual and multi-domain spontaneous speech.
INTERSPEECH
(2001)