Login / Signup
How to Design a Three-Stage Architecture for Audio-Visual Active Speaker Detection in the Wild.
Okan Köpüklü
Maja Taseska
Gerhard Rigoll
Published in:
ICCV (2021)
Keyphrases
</>
audio visual
multi modal
visual information
visual data
speaker verification
temporal context
multimedia
low level
emotion recognition
databases
feature space
vehicle detection
person authentication
audio visual speech recognition