Learning Classical Planning Strategies with Policy Gradient.

Pawel Gomoluch Dalal Alrajeh Alessandra Russo

Published in: CoRR (2018)

Keyphrases

policy gradient
learning algorithm
reinforcement learning
domain independent
learning tasks
planning problems
objective function
domain specific
orders of magnitude
heuristic search
state action
actor critic