What Models Know About Their Attackers: Deriving Attacker Information From Latent Representations.
Zhouhang XieJonathan BrophyAdam NoackWencong YouKalyani AsthanaCarter PerkinsSabrina ReisZayd HammoudehDaniel LowdSameer SinghPublished in: BlackboxNLP@EMNLP (2021)