Enhancing the Insertion of NOP Instructions to Obfuscate Malware via Deep Reinforcement Learning.
Daniel GibertMatt FredriksonCarles MateuJordi PlanesQuan LePublished in: CoRR (2021)
Keyphrases
- reinforcement learning
- function approximation
- markov decision processes
- state space
- malware detection
- reinforcement learning algorithms
- reverse engineering
- robotic control
- optimal policy
- temporal difference
- learning algorithm
- dynamic analysis
- database
- multi agent
- dynamic programming
- static analysis
- malicious code
- policy search
- multi agent reinforcement learning
- neural network
- temporal difference learning
- action space
- machine learning
- model free
- information systems
- transfer learning
- least squares
- learning process