Login / Signup
Exploiting Explainability to Design Adversarial Attacks and Evaluate Attack Resilience in Hate-Speech Detection Models.
Pranath Reddy Kumbam
Sohaib Uddin Syed
Prashanth Thamminedi
Suhas Harish
Ian Perera
Bonnie J. Dorr
Published in:
CoRR (2023)
Keyphrases
</>
attack detection
probabilistic model
countermeasures
design process
speech recognition
detection method
ddos attacks
malicious users
differential power analysis
multi agent
detection algorithm
missing data
detection mechanism
malicious attacks
detecting malicious