Analysis and Optimisation of Bellman Residual Errors with Neural Function Approximation.

Martin Gottwald Sven Gronauer Hao Shen Klaus Diepold

Published in: CoRR (2021)

Keyphrases

function approximation
reinforcement learning
temporal difference learning algorithms
linear program
temporal difference learning
data mining
genetic algorithm
learning process
text mining
learning tasks
model free
temporal difference