Login / Signup
Model-Based Policy Search Using Monte Carlo Gradient Estimation with Real Systems Application.
Fabio Amadio
Alberto Dalla Libera
Riccardo Antonello
Daniel Nikovski
Ruggero Carli
Diego Romeres
Published in:
CoRR (2021)
Keyphrases
</>
monte carlo
gradient estimation
variance reduction
importance sampling
monte carlo methods
policy search
markov chain
reinforcement learning
learning algorithm
upper bound
particle filter
temporal difference