Login / Signup
Model-free safe policy learning via hard action barrier functions.
Agustin Castellano
Juan Andrés Bazerque
Enrique Mallada
Published in:
CISS (2021)
Keyphrases
</>
model free
reinforcement learning
learning tasks
action selection
learning algorithm
learning process
rl algorithms
neural network
feature selection
policy iteration
action space
state action
reinforcement learning methods
policy evaluation