Login / Signup
Q* Approximation Schemes for Batch Reinforcement Learning: A Theoretical Comparison.
Tengyang Xie
Nan Jiang
Published in:
UAI (2020)
Keyphrases
</>
approximation schemes
reinforcement learning
approximation algorithms
learning process
function approximation
optimal control
control policy
learning algorithm
multi agent
temporal difference