Login / Signup
Randomized Exploration for Reinforcement Learning with General Value Function Approximation.
Haque Ishfaq
Qiwen Cui
Viet Nguyen
Alex Ayoub
Zhuoran Yang
Zhaoran Wang
Doina Precup
Lin F. Yang
Published in:
CoRR (2021)
Keyphrases
</>
reinforcement learning
state space
machine learning
active exploration
temporal difference
special case
closely related
state action
approximate dynamic programming
dynamic programming
markov chain
basis functions
function approximation
multi agent
control policy
temporal difference learning
data sets