Login / Signup
Intel Labs at Ego4D Challenge 2022: A Better Baseline for Audio-Visual Diarization.
Kyle Min
Published in:
CoRR (2022)
Keyphrases
</>
audio visual
multi modal
visual information
visual data
emotion recognition
temporal context
multi stream
multimedia
person authentication
audio visual speech recognition
audio features
pattern recognition
speaker verification
bag of words