Login / Signup
Guanlong Zhao
ORCID
Publication Activity (10 Years)
Years Active: 2017-2024
Publications (10 Years): 28
Top Topics
Change Detection
Speaker Diarization
Speech Corpus
Acoustic Models
Top Venues
CoRR
ICASSP
INTERSPEECH
IEEE ACM Trans. Audio Speech Lang. Process.
</>
Publications
</>
Quan Wang
,
Yiling Huang
,
Guanlong Zhao
,
Evan Clark
,
Wei Xia
,
Hank Liao
DiarizationLM: Speaker Diarization Post-Processing with Large Language Models.
CoRR
(2024)
Guanlong Zhao
,
Yongqiang Wang
,
Jason Pelecanos
,
Yu Zhang
,
Hank Liao
,
Yiling Huang
,
Han Lu
,
Quan Wang
USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models.
ICASSP
(2024)
Beltrán Labrador
,
Pai Zhu
,
Guanlong Zhao
,
Angelo Scorza Scarpati
,
Quan Wang
,
Alicia Lozano-Diez
,
Alex Park
,
Ignacio López-Moreno
Personalizing Keyword Spotting with Speaker Information.
CoRR
(2023)
Yiling Huang
,
Weiran Wang
,
Guanlong Zhao
,
Hank Liao
,
Wei Xia
,
Quan Wang
Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network.
CoRR
(2023)
Beltrán Labrador
,
Guanlong Zhao
,
Ignacio López-Moreno
,
Angelo Scorza Scarpati
,
Liam Fowl
,
Quan Wang
Exploring Sequence-to-Sequence Transformer-Transducer Models for Keyword Spotting.
ICASSP
(2023)
Guanlong Zhao
,
Quan Wang
,
Han Lu
,
Yiling Huang
,
Ignacio López-Moreno
Augmenting Transformer-Transducer Based Speaker Change Detection with Token-Level Training Loss.
ICASSP
(2023)
Guanlong Zhao
,
Yongqiang Wang
,
Jason Pelecanos
,
Yu Zhang
,
Hank Liao
,
Yiling Huang
,
Han Lu
,
Quan Wang
USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models.
CoRR
(2023)
Beltrán Labrador
,
Guanlong Zhao
,
Ignacio López-Moreno
,
Angelo Scorza Scarpati
,
Liam Fowl
,
Quan Wang
Exploring Sequence-to-Sequence Transformer-Transducer Models for Keyword Spotting.
CoRR
(2022)
Shaojin Ding
,
Guanlong Zhao
,
Ricardo Gutierrez-Osuna
Accentron: Foreign accent conversion to arbitrary non-native speakers using zero-shot learning.
Comput. Speech Lang.
72 (2022)
Quan Wang
,
Yiling Huang
,
Han Lu
,
Guanlong Zhao
,
Ignacio Lopez-Moreno
Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering.
CoRR
(2022)
Guanlong Zhao
,
Quan Wang
,
Han Lu
,
Yiling Huang
,
Ignacio Lopez Moreno
Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss.
CoRR
(2022)
Adam Hair
,
Guanlong Zhao
,
Beena Ahmed
,
Kirrie J. Ballard
,
Ricardo Gutierrez-Osuna
Assessing Posterior-Based Mispronunciation Detection on Field-Collected Recordings from Child Speech Therapy Sessions.
Interspeech
(2021)
Alif Silpachai
,
Ivana Rehman
,
Taylor Anne Barriuso
,
John Levis
,
Evgeny Chukharev-Hudilainen
,
Guanlong Zhao
,
Ricardo Gutierrez-Osuna
Effects of Voice Type and Task on L2 Learners' Awareness of Pronunciation Errors.
Interspeech
(2021)
Guanlong Zhao
,
Shaojin Ding
,
Ricardo Gutierrez-Osuna
Converting Foreign Accent Speech Without a Reference.
IEEE ACM Trans. Audio Speech Lang. Process.
29 (2021)
Shaojin Ding
,
Guanlong Zhao
,
Christopher Liberatore
,
Ricardo Gutierrez-Osuna
Learning Structured Sparse Representations for Voice Conversion.
IEEE ACM Trans. Audio Speech Lang. Process.
28 (2020)
Arindrima Datta
,
Guanlong Zhao
,
Bhuvana Ramabhadran
,
Eugene Weinstein
LSTM Acoustic Models Learn to Align and Pronounce with Graphemes.
CoRR
(2020)
Shaojin Ding
,
Guanlong Zhao
,
Ricardo Gutierrez-Osuna
Improving the Speaker Identity of Non-Parallel Many-to-Many Voice Conversion with Adversarial Speaker Recognition.
INTERSPEECH
(2020)
Anurag Das
,
Guanlong Zhao
,
John Levis
,
Evgeny Chukharev-Hudilainen
,
Ricardo Gutierrez-Osuna
Understanding the Effect of Voice Quality and Accent on Talker Similarity.
INTERSPEECH
(2020)
Guanlong Zhao
,
Ricardo Gutierrez-Osuna
Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion.
IEEE ACM Trans. Audio Speech Lang. Process.
27 (10) (2019)
Shaojin Ding
,
Christopher Liberatore
,
Sinem Sonsaat
,
Ivana Lucic
,
Alif Silpachai
,
Guanlong Zhao
,
Evgeny Chukharev-Hudilainen
,
John Levis
,
Ricardo Gutierrez-Osuna
Golden speaker builder - An interactive tool for pronunciation training.
Speech Commun.
115 (2019)
Guanlong Zhao
,
Shaojin Ding
,
Ricardo Gutierrez-Osuna
Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams.
INTERSPEECH
(2019)
Shaojin Ding
,
Guanlong Zhao
,
Christopher Liberatore
,
Ricardo Gutierrez-Osuna
Improving Sparse Representations in Exemplar-Based Voice Conversion with a Phoneme-Selective Objective Function.
INTERSPEECH
(2018)
Yu Liu
,
Guanlong Zhao
,
Boyuan Gong
,
Yang Li
,
Ritu Raj
,
Niraj Goel
,
Satya Kesav
,
Sandeep Gottimukkala
,
Zhangyang Wang
,
Wenqi Ren
,
Dacheng Tao
Improved Techniques for Learning to Dehaze and Beyond: A Collective Study.
CoRR
(2018)
Guanlong Zhao
,
Sinem Sonsaat
,
John Levis
,
Evgeny Chukharev-Hudilainen
,
Ricardo Gutierrez-Osuna
Accent Conversion Using Phonetic Posteriorgrams.
ICASSP
(2018)
Yu Liu
,
Guanlong Zhao
PAD-Net: A Perception-Aided Single Image Dehazing Network.
CoRR
(2018)
Guanlong Zhao
,
Sinem Sonsaat
,
Alif Silpachai
,
Ivana Lucic
,
Evgeny Chukharev-Hudilainen
,
John Levis
,
Ricardo Gutierrez-Osuna
L2-ARCTIC: A Non-native English Speech Corpus.
INTERSPEECH
(2018)
Christopher Liberatore
,
Guanlong Zhao
,
Ricardo Gutierrez-Osuna
Voice Conversion Through Residual Warping in a Sparse, Anchor-Based Representation of Speech.
ICASSP
(2018)
Guanlong Zhao
,
Ricardo Gutierrez-Osuna
Exemplar selection methods in voice conversion.
ICASSP
(2017)