Approximating Martingale Process for Variance Reduction in Deep Reinforcement Learning with Large State Space.

Published in: CoRR (2022)

Keyphrases

state space
reinforcement learning
variance reduction
optimal policy
reinforcement learning algorithms
learning algorithm
dynamic programming
function approximation
machine learning
monte carlo
temporal difference
training data
gradient estimation