Login / Signup
Suwon Shon
Publication Activity (10 Years)
Years Active: 2012-2024
Publications (10 Years): 62
Top Topics
Language Understanding
Convolutional Neural Networks
Speaker Recognition
Negative Matrix Factorization
Top Venues
CoRR
INTERSPEECH
ICASSP
SLT
</>
Publications
</>
Suwon Shon
,
Kwangyoun Kim
,
Prashant Sridhar
,
Yi-Te Hsu
,
Shinji Watanabe
,
Karen Livescu
Generative Context-Aware Fine-Tuning of Self-Supervised Speech Models.
ICASSP
(2024)
Siddhant Arora
,
Ankita Pasad
,
Chung-Ming Chien
,
Jionghao Han
,
Roshan S. Sharma
,
Jee-weon Jung
,
Hira Dhamyal
,
William Chen
,
Suwon Shon
,
Hung-yi Lee
,
Karen Livescu
,
Shinji Watanabe
On the Evaluation of Speech Foundation Models for Spoken Language Understanding.
CoRR
(2024)
Jiyang Tang
,
Kwangyoun Kim
,
Suwon Shon
,
Felix Wu
,
Prashant Sridhar
Improving ASR Contextual Biasing with Guided Attention.
ICASSP
(2024)
Siddhant Arora
,
Ankita Pasad
,
Chung-Ming Chien
,
Jionghao Han
,
Roshan S. Sharma
,
Jee-weon Jung
,
Hira Dhamyal
,
William Chen
,
Suwon Shon
,
Hung-yi Lee
,
Karen Livescu
,
Shinji Watanabe
On the Evaluation of Speech Foundation Models for Spoken Language Understanding.
ACL (Findings)
(2024)
Jiyang Tang
,
Kwangyoun Kim
,
Suwon Shon
,
Felix Wu
,
Prashant Sridhar
,
Shinji Watanabe
Improving ASR Contextual Biasing with Guided Attention.
CoRR
(2024)
Suwon Shon
,
Kwangyoun Kim
,
Yi-Te Hsu
,
Prashant Sridhar
,
Shinji Watanabe
,
Karen Livescu
DiscreteSLU: A Large Language Model with Self-Supervised Discrete Speech Units for Spoken Language Understanding.
CoRR
(2024)
Yifan Peng
,
Kwangyoun Kim
,
Felix Wu
,
Brian Yan
,
Siddhant Arora
,
William Chen
,
Jiyang Tang
,
Suwon Shon
,
Prashant Sridhar
,
Shinji Watanabe
A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks.
CoRR
(2023)
Suwon Shon
,
Siddhant Arora
,
Chyi-Jiunn Lin
,
Ankita Pasad
,
Felix Wu
,
Roshan S. Sharma
,
Wei-Lun Wu
,
Hung-yi Lee
,
Karen Livescu
,
Shinji Watanabe
SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks.
ACL (1)
(2023)
Yifan Peng
,
Kwangyoun Kim
,
Felix Wu
,
Brian Yan
,
Siddhant Arora
,
William Chen
,
Jiyang Tang
,
Suwon Shon
,
Prashant Sridhar
,
Shinji Watanabe
A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks.
INTERSPEECH
(2023)
Suwon Shon
,
Felix Wu
,
Kwangyoun Kim
,
Prashant Sridhar
,
Karen Livescu
,
Shinji Watanabe
Context-Aware Fine-Tuning of Self-Supervised Speech Models.
ICASSP
(2023)
Suwon Shon
,
Kwangyoun Kim
,
Prashant Sridhar
,
Yi-Te Hsu
,
Shinji Watanabe
,
Karen Livescu
Generative Context-aware Fine-tuning of Self-supervised Speech Models.
CoRR
(2023)
Ankita Pasad
,
Felix Wu
,
Suwon Shon
,
Karen Livescu
,
Kyu J. Han
On the Use of External Data for Spoken Named Entity Recognition.
NAACL-HLT
(2022)
Suwon Shon
,
Siddhant Arora
,
Chyi-Jiunn Lin
,
Ankita Pasad
,
Felix Wu
,
Roshan Sharma
,
Wei-Lun Wu
,
Hung-Yi Lee
,
Karen Livescu
,
Shinji Watanabe
SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks.
CoRR
(2022)
Suwon Shon
,
Felix Wu
,
Kwangyoun Kim
,
Prashant Sridhar
,
Karen Livescu
,
Shinji Watanabe
Context-aware Fine-tuning of Self-supervised Speech Models.
CoRR
(2022)
Suwon Shon
,
Ankita Pasad
,
Felix Wu
,
Pablo Brusco
,
Yoav Artzi
,
Karen Livescu
,
Kyu J. Han
SLUE: New Benchmark Tasks For Spoken Language Understanding Evaluation on Natural Speech.
ICASSP
(2022)
Suwon Shon
,
Pablo Brusco
,
Jing Pan
,
Kyu J. Han
,
Shinji Watanabe
Leveraging Pre-Trained Language Model for Speech Sentiment Analysis.
Interspeech
(2021)
Ankita Pasad
,
Felix Wu
,
Suwon Shon
,
Karen Livescu
,
Kyu J. Han
On the Use of External Data for Spoken Named Entity Recognition.
CoRR
(2021)
Suwon Shon
,
Ankita Pasad
,
Felix Wu
,
Pablo Brusco
,
Yoav Artzi
,
Karen Livescu
,
Kyu J. Han
SLUE: New Benchmark Tasks for Spoken Language Understanding Evaluation on Natural Speech.
CoRR
(2021)
Suwon Shon
,
Pablo Brusco
,
Jing Pan
,
Kyu J. Han
,
Shinji Watanabe
Leveraging Pre-trained Language Model for Speech Sentiment Analysis.
CoRR
(2021)
Suwon Shon
,
Ahmed Ali
,
Younes Samih
,
Hamdy Mubarak
,
James R. Glass
ADI17: A Fine-Grained Arabic Dialect Identification Dataset.
ICASSP
(2020)
Suwon Shon
,
James R. Glass
Multimodal Association for Speaker Verification.
INTERSPEECH
(2020)
Shammur A. Chowdhury
,
Ahmed Ali
,
Suwon Shon
,
James R. Glass
What Does an End-to-End Dialect Identification Model Learn About Non-Dialectal Information?
INTERSPEECH
(2020)
Suwon Shon
,
Najim Dehak
,
Douglas A. Reynolds
,
James R. Glass
MCE 2018: The 1st Multi-Target Speaker Detection and Identification Challenge Evaluation.
INTERSPEECH
(2019)
Suwon Shon
,
Hao Tang
,
James R. Glass
VoiceID Loss: Speech Enhancement for Speaker Verification.
INTERSPEECH
(2019)
Jesús Villalba
,
Nanxin Chen
,
David Snyder
,
Daniel Garcia-Romero
,
Alan McCree
,
Gregory Sell
,
Jonas Borgstrom
,
Fred Richardson
,
Suwon Shon
,
François Grondin
,
Réda Dehak
,
Leibny Paola García-Perera
,
Daniel Povey
,
Pedro A. Torres-Carrasquillo
,
Sanjeev Khudanpur
,
Najim Dehak
State-of-the-Art Speaker Recognition for Telephone and Video Speech: The JHU-MIT Submission for NIST SRE18.
INTERSPEECH
(2019)
Seongkyu Mun
,
Suwon Shon
Domain Mismatch Robust Acoustic Scene Classification Using Channel Information Conversion.
ICASSP
(2019)
Suwon Shon
,
Hao Tang
,
James R. Glass
VoiceID Loss: Speech Enhancement for Speaker Verification.
CoRR
(2019)
Suwon Shon
,
Ahmed Ali
,
James R. Glass
Domain Attentive Fusion for End-to-end Dialect Identification with Unknown Target Domain.
ICASSP
(2019)
Suwon Shon
,
Tae-Hyun Oh
,
James R. Glass
Noise-tolerant Audio-visual Online Person Verification Using an Attention-based Neural Network Fusion.
ICASSP
(2019)
Suwon Shon
,
Najim Dehak
,
Douglas A. Reynolds
,
James R. Glass
MCE 2018: The 1st Multi-target Speaker Detection and Identification Challenge Evaluation.
CoRR
(2019)
Achintya Kumar Sarkar
,
Zheng-Hua Tan
,
Hao Tang
,
Suwon Shon
,
James R. Glass
Time-Contrastive Learning Based Deep Bottleneck Features for Text-Dependent Speaker Verification.
IEEE ACM Trans. Audio Speech Lang. Process.
27 (8) (2019)
Achintya Kumar Sarkar
,
Zheng-Hua Tan
,
Hao Tang
,
Suwon Shon
,
James R. Glass
Time-Contrastive Learning Based Deep Bottleneck Features for Text-Dependent Speaker Verification.
CoRR
(2019)
Ahmed Ali
,
Suwon Shon
,
Younes Samih
,
Hamdy Mubarak
,
Ahmed Abdelali
,
James R. Glass
,
Steve Renals
,
Khalid Choukri
The MGB-5 Challenge: Recognition and Dialect Identification of Dialectal Arabic Speech.
ASRU
(2019)
Suwon Shon
,
Younggun Lee
,
Taesu Kim
Large-Scale Speaker Retrieval on Random Speaker Variability Subspace.
INTERSPEECH
(2019)
Suwon Shon
,
Wei-Ning Hsu
,
James R. Glass
Unsupervised Representation Learning of Speech for Dialect Identification.
SLT
(2018)
Suwon Shon
,
Tae-Hyun Oh
,
James R. Glass
Noise-tolerant Audio-visual Online Person Verification using an Attention-based Neural Network Fusion.
CoRR
(2018)
Suwon Shon
,
Hao Tang
,
James R. Glass
Frame-Level Speaker Embeddings for Text-Independent Speaker Recognition and Analysis of End-to-End Model.
SLT
(2018)
Suwon Shon
,
Ahmed Ali
,
James R. Glass
Convolutional Neural Networks and Language Embeddings for End-to-End Dialect Recognition.
CoRR
(2018)
Suwon Shon
,
Wei-Ning Hsu
,
James R. Glass
Unsupervised Representation Learning of Speech for Dialect Identification.
CoRR
(2018)
Suwon Shon
,
Hao Tang
,
James R. Glass
Frame-level speaker embeddings for text-independent speaker recognition and analysis of end-to-end model.
CoRR
(2018)
Marcos Zampieri
,
Shervin Malmasi
,
Preslav Nakov
,
Ahmed Ali
,
Suwon Shon
,
James R. Glass
,
Yves Scherrer
,
Tanja Samardzic
,
Nikola Ljubesic
,
Jörg Tiedemann
,
Chris van der Lee
,
Stefan Grondelaers
,
Nelleke Oostdijk
,
Dirk Speelman
,
Antal van den Bosch
,
Ritesh Kumar
,
Bornini Lahiri
,
Mayank Jain
Language Identification and Morphosyntactic Tagging: The Second VarDial Evaluation Campaign.
VarDial@COLING 2018
(2018)
Maryam Najafian
,
Sameer Khurana
,
Suwon Shon
,
Ahmed Ali
,
James R. Glass
Exploiting Convolutional Neural Networks for Phonotactic Based Dialect Identification.
ICASSP
(2018)
Suwon Shon
,
Ahmed Ali
,
James R. Glass
Convolutional Neural Network and Language Embeddings for End-to-End Dialect Recognition.
Odyssey
(2018)
Seongkyu Mun
,
Suwon Shon
Domain Mismatch Robust Acoustic Scene Classification using Channel Information Conversion.
CoRR
(2018)
Suwon Shon
,
Ahmed Ali
,
James R. Glass
Domain Attentive Fusion for End-to-end Dialect Identification with Unknown Target Domain.
CoRR
(2018)
Suwon Shon
,
Younggun Lee
,
Taesu Kim
Large-scale Speaker Retrieval on Random Speaker Variability Subspace.
CoRR
(2018)
Suwon Shon
,
Najim Dehak
,
Douglas A. Reynolds
,
James R. Glass
MCE 2018: The 1st Multi-target Speaker Detection and Identification Challenge Evaluation (MCE) Plan, Dataset and Baseline System.
CoRR
(2018)
Seongkyu Mun
,
Minkyu Shin
,
Suwon Shon
,
Wooil Kim
,
David K. Han
,
Hanseok Ko
DNN Transfer Learning Based Non-Linear Feature Extraction for Acoustic Event Classification.
IEICE Trans. Inf. Syst.
(9) (2017)
Seongkyu Mun
,
Minkyu Shin
,
Suwon Shon
,
Wooil Kim
,
David K. Han
,
Hanseok Ko
DNN Transfer Learning based Non-linear Feature Extraction for Acoustic Event Classification.
CoRR
(2017)
Suwon Shon
,
Seongkyu Mun
,
Hanseok Ko
Recursive Whitening Transformation for Speaker Recognition on Language Mismatched Condition.
CoRR
(2017)
Suwon Shon
,
Seongkyu Mun
,
Wooil Kim
,
Hanseok Ko
Autoencoder based Domain Adaptation for Speaker Recognition under Insufficient Channel Information.
CoRR
(2017)
Suwon Shon
,
Hanseok Ko
KU-ISPL Speaker Recognition Systems under Language mismatch condition for NIST 2016 Speaker Recognition Evaluation.
CoRR
(2017)
Seongkyu Mun
,
Suwon Shon
,
Wooil Kim
,
David K. Han
,
Hanseok Ko
Deep Neural Network based learning and transferring mid-level audio features for acoustic scene classification.
ICASSP
(2017)
Suwon Shon
,
Seongkyu Mun
,
Hanseok Ko
Recursive Whitening Transformation for Speaker Recognition on Language Mismatched Condition.
INTERSPEECH
(2017)
Seongkyu Mun
,
Suwon Shon
,
Wooil Kim
,
David K. Han
,
Hanseok Ko
A Novel Discriminative Feature Extraction for Acoustic Scene Classification Using RNN Based Source Separation.
IEICE Trans. Inf. Syst.
(12) (2017)
Suwon Shon
,
Ahmed Ali
,
James R. Glass
MIT-QCRI Arabic Dialect Identification System for the 2017 Multi-Genre Broadcast Challenge.
CoRR
(2017)
Suwon Shon
,
Ahmed Ali
,
James R. Glass
MIT-QCRI Arabic dialect identification system for the 2017 multi-genre broadcast challenge.
ASRU
(2017)
Suwon Shon
,
Seongkyu Mun
,
Wooil Kim
,
Hanseok Ko
Autoencoder Based Domain Adaptation for Speaker Recognition Under Insufficient Channel Information.
INTERSPEECH
(2017)
Suwon Shon
,
Seongkyu Mun
,
John H. L. Hansen
,
Hanseok Ko
KU-ISPL Language Recognition System for NIST 2015 i-Vector Machine Learning Challenge.
CoRR
(2016)
Seong Jae Lee
,
Daehun Kim
,
Suwon Shon
,
Seongkyu Mun
,
Minkyu Shin
,
Youngseng Chen
,
Sejong Hyung
,
Mohammed Harris
,
Hanseok Ko
KU-ISPL TRECVID 2016 Multimedia Event Detection System.
TRECVID
(2016)
Suwon Shon
,
Seongkyu Mun
,
David K. Han
,
Hanseok Ko
Non-negative matrix factorization-based subband decomposition for acoustic source localization.
CoRR
(2016)
Seongkyu Mun
,
Suwon Shon
,
Wooil Kim
,
Hanseok Ko
Deep Neural Network Bottleneck Features for Acoustic Event Recognition.
INTERSPEECH
(2016)
Suwon Shon
,
Seongkyu Mun
,
David K. Han
,
Hanseok Ko
Maximum likelihood Linear Dimension Reduction of heteroscedastic feature for robust Speaker Recognition.
AVSS
(2015)
Seongkyu Mun
,
Suwon Shon
,
Wooil Kim
,
Hanseok Ko
Robust speaker direction estimation with microphone array using NMF for smart TV interaction.
ICCE
(2015)
Sungkyu Moon
,
Suwon Shon
,
Wooil Kim
,
David K. Han
Generalized cross-correlation based noise robust abnormal acoustic event localization utilizing non-negative matrix factorization.
AVSS
(2014)
Suwon Shon
,
David K. Han
,
Hanseok Ko
Abnormal acoustic event localization based on selective frequency bin in high noise environment for audio surveillance.
AVSS
(2013)
Suwon Shon
,
David K. Han
,
Jounghoon Beh
,
Hanseok Ko
Full Azimuth Multiple Sound Source Localization with 3-Channel Microphone Array.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci.
(4) (2012)
Suwon Shon
,
Eric Kim
,
Jongsung Yoon
,
Hanseok Ko
Sudden noise source localization system for intelligent automobile application with acoustic sensors.
ICCE
(2012)