AIMED-RL: Exploring Adversarial Malware Examples with Reinforcement Learning.
Raphael Labaca CastroSebastian FranzGabi Dreo RodosekPublished in: ECML/PKDD (4) (2021)
Keyphrases
- reinforcement learning
- multi agent
- function approximation
- state space
- reinforcement learning algorithms
- model free
- markov decision processes
- learning algorithm
- temporal difference learning
- learning agents
- optimal policy
- rl algorithms
- control problems
- machine learning
- transfer learning
- learning process
- temporal difference
- supervised learning
- learning problems
- continuous state
- autonomous learning
- direct policy search
- reward function
- learning classifier systems
- optimal control
- function approximators
- reinforcement learning methods
- multiagent reinforcement learning
- continuous state and action spaces