Login / Signup
Gradient-Aware Model-based Policy Search.
Pierluca D'Oro
Alberto Maria Metelli
Andrea Tirinzoni
Matteo Papini
Marcello Restelli
Published in:
CoRR (2019)
Keyphrases
</>
policy search
policy gradient
reinforcement learning
reinforcement learning algorithms
model free
dynamic programming
partially observable markov decision processes
continuous state
continuous action
gradient method
probabilistic model
reward function