Login / Signup
Krylov-Bellman boosting: Super-linear policy evaluation in general state spaces.
Eric Xia
Martin J. Wainwright
Published in:
CoRR (2022)
Keyphrases
</>
policy evaluation
state space
variance reduction
least squares
markov decision processes
reinforcement learning
markov chain
reinforcement learning algorithms
learning algorithm
feature selection
monte carlo
temporal difference
policy iteration