On the bias of batch Bellman residual minimisation.

Daniel Schneegaß

Published in: Neurocomputing (2009)

Keyphrases

bellman residual
approximation methods
fixed point
policy iteration
sample path
policy evaluation
variance reduction
optimization criterion
hybrid algorithms
markov decision processes
neural network
sufficient conditions
sample size
least squares
model free
optical flow
asymptotic analysis
reinforcement learning