Login / Signup
Learning General Policies with Policy Gradient Methods.
Simon Ståhlberg
Blai Bonet
Hector Geffner
Published in:
KR (2023)
Keyphrases
</>
policy gradient methods
learning algorithm
learning process
policy gradient
learning problems
neural network
model checking
decision theoretic
natural actor critic