Almost Optimal Model-Free Reinforcement Learningvia Reference-Advantage Decomposition.

Zihan Zhang Yuan Zhou Xiangyang Ji

Published in: NeurIPS (2020)

Keyphrases

model free
reinforcement learning
reinforcement learning algorithms
function approximation
dynamic programming
temporal difference
optimal control
optimal solution
average reward
state space
policy iteration
data mining
learning algorithm
artificial neural networks
impedance control