Login / Signup
On Policy Evaluation Algorithms in Distributional Reinforcement Learning.
Julian Gerstenberg
Ralph Neininger
Denis Spiegel
Published in:
CoRR (2024)
Keyphrases
</>
policy evaluation
reinforcement learning
model free
policy iteration
learning algorithm
temporal difference
markov decision processes
monte carlo
least squares
function approximation
worst case
optimal control
td learning
machine learning
supervised learning
objective function
matrix inversion