Login / Signup
Uncertainty-Guided End-to-End Audio-Visual Speaker Diarization for Far-Field Recordings.
Chenyu Yang
Mengxi Chen
Yanfeng Wang
Yu Wang
Published in:
ACM Multimedia (2023)
Keyphrases
</>
audio visual
end to end
speaker diarization
speaker verification
multi modal
visual information
speech recognition
visual data
emotion recognition
audio features
multimedia
broadcast news
acoustic features
bayesian information criterion
feature vectors
pattern recognition
speaker identification
computer vision