Login / Signup
Approximating Martingale Process for Variance Reduction in Deep Reinforcement Learning with Large State Space.
Charlie Ruan
Published in:
CoRR (2022)
Keyphrases
</>
state space
reinforcement learning
variance reduction
optimal policy
reinforcement learning algorithms
learning algorithm
dynamic programming
function approximation
machine learning
monte carlo
temporal difference
training data
gradient estimation