Login / Signup
Provable Benefits of Actor-Critic Methods for Offline Reinforcement Learning.
Andrea Zanette
Martin J. Wainwright
Emma Brunskill
Published in:
CoRR (2021)
Keyphrases
</>
reinforcement learning
optimization methods
learning algorithm
cost function
least squares
optimal control
model free
temporal difference
control problems
actor critic