Model-free safe policy learning via hard action barrier functions.

Agustin Castellano Juan Andrés Bazerque Enrique Mallada

Published in: CISS (2021)

Keyphrases

model free
reinforcement learning
learning tasks
action selection
learning algorithm
learning process
rl algorithms
neural network
feature selection
policy iteration
action space
state action
reinforcement learning methods
policy evaluation