Login / Signup
Push-Pull: Characterizing the Adversarial Robustness for Audio-Visual Active Speaker Detection.
Xuanjun Chen
Haibin Wu
Helen Meng
Hung-yi Lee
Jyh-Shing Roger Jang
Published in:
SLT (2022)
Keyphrases
</>
audio visual
multi modal
speaker verification
visual information
visual data
temporal context
person authentication
multimedia
emotion recognition
multi stream
audio visual speech recognition
audio features
speech recognition