What Is Acceptably Safe for Reinforcement Learning?
John BraggIbrahim HabliPublished in: SAFECOMP Workshops (2018)
Keyphrases
- reinforcement learning
- function approximation
- model free
- reinforcement learning algorithms
- multi agent
- state space
- temporal difference
- optimal policy
- temporal difference learning
- markov decision processes
- supervised learning
- multi agent reinforcement learning
- function approximators
- learning process
- learning algorithm
- machine learning
- data sets
- expert systems
- learning problems
- knowledge base
- decision making
- artificial intelligence
- learning capabilities
- action space
- real world
- databases