AIMED-RL: Exploring Adversarial Malware Examples with Reinforcement Learning.

Raphael Labaca Castro Sebastian Franz Gabi Dreo Rodosek

Published in: ECML/PKDD (4) (2021)

Keyphrases

reinforcement learning
multi agent
function approximation
state space
reinforcement learning algorithms
model free
markov decision processes
learning algorithm
temporal difference learning
learning agents
optimal policy
rl algorithms
control problems
machine learning
transfer learning
learning process
temporal difference
supervised learning
learning problems
continuous state
autonomous learning
direct policy search
reward function
learning classifier systems
optimal control
function approximators
reinforcement learning methods
multiagent reinforcement learning
continuous state and action spaces