​
Login / Signup
Danwei Cai
ORCID
Publication Activity (10 Years)
Years Active: 2016-2024
Publications (10 Years): 36
Top Topics
Speaker Verification
Negative Matrix Factorization
Reflective Learning
Gaussian Mixture
Top Venues
CoRR
INTERSPEECH
ICASSP
IEEE ACM Trans. Audio Speech Lang. Process.
</>
Publications
</>
Danwei Cai
,
Ming Li
Leveraging ASR Pretrained Conformers for Speaker Verification Through Transfer Learning and Knowledge Distillation.
IEEE ACM Trans. Audio Speech Lang. Process.
32 (2024)
Danwei Cai
,
Zexin Cai
,
Ming Li
Self-supervised Reflective Learning through Self-distillation and Online Clustering for Speaker Representation Learning.
CoRR
(2024)
Weiqing Wang
,
Danwei Cai
,
Ming Cheng
,
Ming Li
Joint Inference of Speaker Diarization and ASR with Multi-Stage Information Sharing.
ICASSP
(2024)
Danwei Cai
,
Zexin Cai
,
Ming Li
Identifying Source Speakers for Voice Conversion Based Spoofing Attacks on Speaker Verification Systems.
ICASSP
(2023)
Danwei Cai
,
Weiqing Wang
,
Ming Li
,
Rui Xia
,
Chuanzeng Huang
Pretraining Conformer with ASR for Speaker Verification.
ICASSP
(2023)
Xiaoyi Qin
,
Danwei Cai
,
Ming Li
Robust Multi-Channel Far-Field Speaker Verification Under Different In-Domain Data Availability Scenarios.
IEEE ACM Trans. Audio Speech Lang. Process.
31 (2023)
Danwei Cai
,
Weiqing Wang
,
Ming Li
Incorporating Visual Information in Audio Based Self-Supervised Speaker Recognition.
IEEE ACM Trans. Audio Speech Lang. Process.
30 (2022)
Weiqing Wang
,
Qingjian Lin
,
Danwei Cai
,
Ming Li
Similarity Measurement of Segment-Level Speaker Embeddings in Speaker Diarization.
IEEE ACM Trans. Audio Speech Lang. Process.
30 (2022)
Danwei Cai
,
Zexin Cai
,
Ming Li
Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification Systems.
CoRR
(2022)
Weiqing Wang
,
Qingjian Lin
,
Danwei Cai
,
Lin Yang
,
Ming Li
The DKU-Duke-Lenovo System Description for the Third DIHARD Speech Diarization Challenge.
CoRR
(2021)
Danwei Cai
,
Weiqing Wang
,
Ming Li
An Iterative Framework for Self-Supervised Deep Speaker Representation Learning.
ICASSP
(2021)
Weiqing Wang
,
Danwei Cai
,
Jin Wang
,
Qingjian Lin
,
Xuyang Wang
,
Mi Hong
,
Ming Li
The DKU-Duke-Lenovo System Description for the Fearless Steps Challenge Phase III.
Interspeech
(2021)
Weiqing Wang
,
Danwei Cai
,
Qingjian Lin
,
Lin Yang
,
Junjie Wang
,
Jin Wang
,
Ming Li
The DKU-DukeECE-Lenovo System for the Diarization Task of the 2021 VoxCeleb Speaker Recognition Challenge.
CoRR
(2021)
Danwei Cai
,
Ming Li
Embedding Aggregation for Far-Field Speaker Verification with Distributed Microphone Arrays.
SLT
(2021)
Danwei Cai
,
Ming Li
The DKU-DukeECE System for the Self-Supervision Speaker Verification Task of the 2021 VoxCeleb Speaker Recognition Challenge.
CoRR
(2021)
Danwei Cai
,
Weiqing Wang
,
Ming Li
An iterative framework for self-supervised deep speaker representation learning.
CoRR
(2020)
Danwei Cai
,
Weicheng Cai
,
Ming Li
Within-Sample Variability-Invariant Loss for Robust Speaker Recognition Under Noisy Environments.
ICASSP
(2020)
Danwei Cai
,
Weicheng Cai
,
Ming Li
Within-sample variability-invariant loss for robust speaker recognition under noisy environments.
CoRR
(2020)
Danwei Cai
,
Xiaoyi Qin
,
Ming Li
Multi-Channel Training for End-to-End Speaker Recognition Under Reverberant and Noisy Environment.
INTERSPEECH
(2019)
Danwei Cai
,
Xiaoyi Qin
,
Weicheng Cai
,
Ming Li
The DKU System for the Speaker Recognition Task of the 2019 VOiCES from a Distance Challenge.
INTERSPEECH
(2019)
Weicheng Cai
,
Haiwei Wu
,
Danwei Cai
,
Ming Li
The DKU Replay Detection System for the ASVspoof 2019 Challenge: On Data Augmentation, Feature Representation, Classification, and Fusion.
CoRR
(2019)
Weicheng Cai
,
Haiwei Wu
,
Danwei Cai
,
Ming Li
The DKU Replay Detection System for the ASVspoof 2019 Challenge: On Data Augmentation, Feature Representation, Classification, and Fusion.
INTERSPEECH
(2019)
Danwei Cai
,
Weicheng Cai
,
Ming Li
The DKU-SMIIP System for NIST 2018 Speaker Recognition Evaluation.
INTERSPEECH
(2019)
Weicheng Cai
,
Danwei Cai
,
Shen Huang
,
Ming Li
Utterance-level end-to-end language identification using attention-based CNN-BLSTM.
CoRR
(2019)
Weicheng Cai
,
Danwei Cai
,
Shen Huang
,
Ming Li
Utterance-level End-to-end Language Identification Using Attention-based CNN-BLSTM.
ICASSP
(2019)
Ming Li
,
Weicheng Cai
,
Danwei Cai
Survey Talk: End-to-End Deep Neural Network Based Speaker and Language Recognition.
INTERSPEECH
(2019)
Xiaoyi Qin
,
Danwei Cai
,
Ming Li
Far-Field End-to-End Text-Dependent Speaker Verification Based on Mixed Training Data with Transfer Learning and Enrollment Data Augmentation.
INTERSPEECH
(2019)
Danwei Cai
,
Zexin Cai
,
Ming Li
Deep Speaker Embeddings with Convolutional Neural Network on Supervector for Text-Independent Speaker Recognition.
APSIPA
(2018)
Jinkun Chen
,
Weicheng Cai
,
Danwei Cai
,
Zexin Cai
,
Haibin Zhong
,
Ming Li
End-to-end Language Identification using NetFV and NetVLAD.
ISCSLP
(2018)
Zexin Cai
,
Xiaoyi Qin
,
Danwei Cai
,
Ming Li
,
Xinzhong Liu
,
Haibin Zhong
The DKU-JNU-EMA Electromagnetic Articulography Database on Mandarin and Chinese Dialects with Tandem Feature based Acoustic-to-Articulatory Inversion.
ISCSLP
(2018)
Kong-Yik Chee
,
Zhe Jin
,
Danwei Cai
,
Ming Li
,
Wun-She Yap
,
Yen-Lung Lai
,
Bok-Min Goi
Cancellable speech template via random binary orthogonal matrices projection hashing.
Pattern Recognit.
76 (2018)
Jinkun Chen
,
Weicheng Cai
,
Danwei Cai
,
Zexin Cai
,
Haibin Zhong
,
Ming Li
End-to-end Language Identification using NetFV and NetVLAD.
CoRR
(2018)
Weicheng Cai
,
Danwei Cai
,
Wenbo Liu
,
Gang Li
,
Ming Li
Countermeasures for Automatic Speaker Verification Replay Spoofing Attack : On Data Augmentation, Feature Representation, Classification and Fusion.
INTERSPEECH
(2017)
Danwei Cai
,
Zhidong Ni
,
Wenbo Liu
,
Weicheng Cai
,
Gang Li
,
Ming Li
End-to-End Deep Learning Framework for Speech Paralinguistics Detection Based on Perception Aware Spectrum.
INTERSPEECH
(2017)
Ming Li
,
Luting Wang
,
Zhicheng Xu
,
Danwei Cai
Mandarin electrolaryngeal voice conversion with combination of Gaussian mixture model and non-negative matrix factorization.
APSIPA
(2017)
Danwei Cai
,
Weicheng Cai
,
Zhidong Ni
,
Ming Li
Locality sensitive discriminant analysis for speaker verification.
APSIPA
(2016)