Login / Signup
Policy Tree: Adaptive Representation for Policy Gradient.
Ujjwal Das Gupta
Erik Talvitie
Michael Bowling
Published in:
AAAI (2015)
Keyphrases
</>
policy gradient
actor critic
reinforcement learning
model free reinforcement learning
gradient method
function approximation
average reward
reinforcement learning methods
policy gradient methods
policy search
bayesian networks
optimal policy
optimal control