Zap Q-Learning With Nonlinear Function Approximation.

Shuhang Chen Adithya M. Devraj Ana Busic Sean P. Meyn

Published in: CoRR (2019)

Keyphrases

function approximation
reinforcement learning
tile coding
temporal difference learning algorithms
state action space
mountain car
learning tasks
radial basis function
model free
temporal difference learning
temporal difference
function approximators
reinforcement learning algorithms
td learning
temporal difference methods
learning algorithm
machine learning
neural network
feature selection