Login / Signup
Reinforcement Learning with General Value Function Approximation: Provably Efficient Approach via Bounded Eluder Dimension.
Ruosong Wang
Russ R. Salakhutdinov
Lin F. Yang
Published in:
NeurIPS (2020)
Keyphrases
</>
reinforcement learning
special case
state space
temporal difference
cost effective
function approximation
approximate dynamic programming
machine learning
closely related
monte carlo
temporal difference learning
lower bound
upper bound
lightweight
action selection