Login / Signup
Real-time Architecture for Audio-Visual Active Speaker Detection.
Min Huang
Wen Wang
Zheyuan Lin
Fiseha B. Tesema
Shanshan Ji
Jason Gu
Minhong Wang
Wei Song
Te Li
Shiqiang Zhu
Published in:
ROBIO (2022)
Keyphrases
</>
audio visual
real time
multi modal
visual information
speaker verification
visual data
multimedia
multi stream
temporal context
person authentication
emotion recognition
audio visual speech recognition