A comprehensive survey on safe reinforcement learning.

Javier García Fernando Fernández

Published in: J. Mach. Learn. Res. (2015)

Keyphrases

reinforcement learning
function approximation
state space
temporal difference
optimal policy
reinforcement learning algorithms
robotic control
learning process
temporal difference learning
stochastic approximation
optimal solution
information systems
robot control
learning algorithm
model free
perceptual aliasing
reinforcement learning methods
control problems
planning problems
optimal control
database
supervised learning
least squares
clustering algorithm
website
neural network