Login / Signup
The FlySpeech Audio-Visual Speaker Diarization System for MISP Challenge 2022.
Li Zhang
Huan Zhao
Yue Li
Bowen Pang
Yannan Wang
Hongji Wang
Wei Rao
Qing Wang
Lei Xie
Published in:
CoRR (2023)
Keyphrases
</>
audio visual
speaker diarization
speaker verification
multi modal
visual information
multi stream
emotion recognition
visual data
speech recognition
multimedia
audio features
low level
bayesian information criterion