Login / Signup
Almost Optimal Model-Free Reinforcement Learningvia Reference-Advantage Decomposition.
Zihan Zhang
Yuan Zhou
Xiangyang Ji
Published in:
NeurIPS (2020)
Keyphrases
</>
model free
reinforcement learning
reinforcement learning algorithms
function approximation
dynamic programming
temporal difference
optimal control
optimal solution
average reward
state space
policy iteration
data mining
learning algorithm
artificial neural networks
impedance control