Login / Signup
Provably Efficient Reinforcement Learning with General Value Function Approximation.
Ruosong Wang
Ruslan Salakhutdinov
Lin F. Yang
Published in:
CoRR (2020)
Keyphrases
</>
reinforcement learning
special case
temporal difference
state space
learning algorithm
closely related
temporal difference learning
information retrieval
lightweight
markov decision processes
markov decision process