Adversarial Attacks to Reward Machine-based Reinforcement Learning.
Lorenzo NodariPublished in: CoRR (2023)
Keyphrases
- reinforcement learning
- multi agent
- state space
- function approximation
- countermeasures
- reward function
- model free
- reinforcement learning algorithms
- markov decision processes
- eligibility traces
- flowshop
- optimal policy
- dynamic programming
- learning algorithm
- machine learning
- learning problems
- action selection
- learning capabilities
- batch processing
- information security
- supervised learning
- partially observable
- learning agent
- state action
- markov decision problems
- malicious attacks