Login / Signup
Tractable and Provably Efficient Distributional Reinforcement Learning with General Value Function Approximation.
Taehyun Cho
Seungyub Han
Kyungjae Lee
Seokhun Ju
Dohyeong Kim
Jungwoo Lee
Published in:
CoRR (2024)
Keyphrases
</>
reinforcement learning
temporal difference
special case
state space
temporal difference learning
learning algorithm
learning process
np complete
markov decision processes
data mining
mobile robot
constraint satisfaction problems
closely related
computationally expensive
version spaces