Login / Signup
The Effect of Q-function Reuse on the Total Regret of Tabular, Model-Free, Reinforcement Learning.
Volodymyr Tkachuk
Sriram Ganapathi Subramanian
Matthew E. Taylor
Published in:
CoRR (2021)
Keyphrases
</>
model free reinforcement learning
policy gradient
machine learning
lower bound
online learning
real valued