Login / Signup
Push-Pull: Characterizing the Adversarial Robustness for Audio-Visual Active Speaker Detection.
Xuanjun Chen
Haibin Wu
Helen Meng
Hung-yi Lee
Jyh-Shing Roger Jang
Published in:
CoRR (2022)
Keyphrases
</>
audio visual
multi modal
visual information
speaker verification
visual data
person authentication
emotion recognition
multimedia
multi stream
temporal context
audio visual speech recognition
audio features
hidden markov models
visual features
image data
domain knowledge
pattern recognition