Learning General Policies with Policy Gradient Methods.

Simon Ståhlberg Blai Bonet Hector Geffner

Published in: KR (2023)

Keyphrases

policy gradient methods
learning algorithm
learning process
policy gradient
learning problems
neural network
model checking
decision theoretic
natural actor critic