Login / Signup
Randomized Exploration in Reinforcement Learning with General Value Function Approximation.
Haque Ishfaq
Qiwen Cui
Viet Nguyen
Alex Ayoub
Zhuoran Yang
Zhaoran Wang
Doina Precup
Lin Yang
Published in:
ICML (2021)
Keyphrases
</>
reinforcement learning
state space
temporal difference
special case
closely related
temporal difference learning
real time
machine learning
learning algorithm
information systems
markov chain
basis functions
active exploration