Evading Anti-Malware Engines With Deep Reinforcement Learning.
Zhiyang FangJunfeng WangBoya LiSiqi WuYingjie ZhouHaiying HuangPublished in: IEEE Access (2019)
Keyphrases
- reinforcement learning
- function approximation
- transition model
- reinforcement learning algorithms
- model free
- optimal control
- state space
- multi agent
- optimal policy
- learning algorithm
- control flow
- malware detection
- transfer learning
- temporal difference
- real time
- open source
- learning process
- reverse engineering
- machine learning
- learning agents
- multi agent reinforcement learning