What Is Acceptably Safe for Reinforcement Learning?

John Bragg Ibrahim Habli

Published in: SAFECOMP Workshops (2018)

Keyphrases

reinforcement learning
function approximation
model free
reinforcement learning algorithms
multi agent
state space
temporal difference
optimal policy
temporal difference learning
markov decision processes
supervised learning
multi agent reinforcement learning
function approximators
learning process
learning algorithm
machine learning
data sets
expert systems
learning problems
knowledge base
decision making
artificial intelligence
learning capabilities
action space
real world
databases