Illusory Attacks: Information-theoretic detectability matters in adversarial attacks.
Tim FranzmeyerStephen Marcus McAleerJoão F. HenriquesJakob Nicolaus FoersterPhilip TorrAdel BibiChristian Schröder de WittPublished in: ICLR (2024)
Keyphrases
- information theoretic
- mutual information
- information theory
- theoretic framework
- information bottleneck
- entropy measure
- watermarking scheme
- log likelihood
- kullback leibler divergence
- information theoretic measures
- computational learning theory
- jensen shannon divergence
- image segmentation
- machine learning
- minimum description length
- relative entropy
- noise level
- distance measure