Login / Signup
Learning Classical Planning Strategies with Policy Gradient.
Pawel Gomoluch
Dalal Alrajeh
Alessandra Russo
Published in:
CoRR (2018)
Keyphrases
</>
policy gradient
learning algorithm
reinforcement learning
domain independent
learning tasks
planning problems
objective function
domain specific
orders of magnitude
heuristic search
state action
actor critic