Login / Signup
Analysis and Optimisation of Bellman Residual Errors with Neural Function Approximation.
Martin Gottwald
Sven Gronauer
Hao Shen
Klaus Diepold
Published in:
CoRR (2021)
Keyphrases
</>
function approximation
reinforcement learning
temporal difference learning algorithms
linear program
temporal difference learning
data mining
genetic algorithm
learning process
text mining
learning tasks
model free
temporal difference