Login / Signup
Ngoc Khanh Nguyen
Publication Activity (10 Years)
Years Active: 2018-2022
Publications (10 Years): 8
Top Topics
Transfer Learning
Background Noise
Error Analysis
Event Detection
Top Venues
CoRR
DCASE
ICASSP
IEEE ACM Trans. Audio Speech Lang. Process.
</>
Publications
</>
Thi Ngoc Tho Nguyen
,
Karn N. Watcharasupat
,
Ngoc Khanh Nguyen
,
Douglas L. Jones
,
Woon-Seng Gan
SALSA: Spatial Cue-Augmented Log-Spectrogram Features for Polyphonic Sound Event Localization and Detection.
IEEE ACM Trans. Audio Speech Lang. Process.
30 (2022)
Karn N. Watcharasupat
,
Thi Ngoc Tho Nguyen
,
Ngoc Khanh Nguyen
,
Zhen Jian Lee
,
Douglas L. Jones
,
Woon-Seng Gan
Improving Polyphonic Sound Event Detection on Multichannel Recordings with the Sørensen-Dice Coefficient Loss and Transfer Learning.
CoRR
(2021)
Thi Ngoc Tho Nguyen
,
Ngoc Khanh Nguyen
,
Huy Phan
,
Lam Pham
,
Kenneth Ooi
,
Douglas L. Jones
,
Woon-Seng Gan
A General Network Architecture for Sound Event Localization and Detection Using Transfer Learning and Recurrent Neural Network.
ICASSP
(2021)
Thi Ngoc Tho Nguyen
,
Karn N. Watcharasupat
,
Ngoc Khanh Nguyen
,
Douglas L. Jones
,
Woon-Seng Gan
SALSA: Spatial Cue-Augmented Log-Spectrogram Features for Polyphonic Sound Event Localization and Detection.
CoRR
(2021)
Thi Ngoc Tho Nguyen
,
Karn N. Watcharasupat
,
Zhen Jian Lee
,
Ngoc Khanh Nguyen
,
Douglas L. Jones
,
Woon-Seng Gan
What Makes Sound Event Localization and Detection Difficult? Insights from Error Analysis.
CoRR
(2021)
Thi Ngoc Tho Nguyen
,
Karn Watcharasupat
,
Ngoc Khanh Nguyen
,
Douglas L. Jones
,
Woon-Seng Gan
DCASE 2021 Task 3: Spectrotemporally-aligned Features for Polyphonic Sound Event Localization and Detection.
CoRR
(2021)
Thi Ngoc Tho Nguyen
,
Karn N. Watcharasupat
,
Zhen Jian Lee
,
Ngoc Khanh Nguyen
,
Douglas L. Jones
,
Woon-Seng Gan
What Makes Sound Event Localization and Detection Difficult? Insights from Error Analysis.
DCASE
(2021)
Thi Ngoc Tho Nguyen
,
Ngoc Khanh Nguyen
,
Douglas L. Jones
,
Woon-Seng Gan
DCASE 2018 task 2: iterative training, label smoothing, and background noise normalization for audio event tagging.
DCASE
(2018)