MERLIN - Malware Evasion with Reinforcement LearnINg.

Tony Quertier Benjamin Marais Stephane Morucci Bertrand Fournel

Published in: CoRR (2022)

Keyphrases

reinforcement learning
function approximation
reverse engineering
countermeasures
reinforcement learning algorithms
malware detection
state space
markov decision processes
learning algorithm
supervised learning
relational reinforcement learning
neural network
policy search
partially observable
temporal difference
optimal policy
dynamic programming
action selection
learning agent
action space
learning agents
learning process
information retrieval
real world