The Value Function Polytope in Reinforcement Learning.

Robert Dadashi Adrien Ali Taïga Nicolas Le Roux Dale Schuurmans Marc G. Bellemare

Published in: CoRR (2019)

Keyphrases

reinforcement learning
case study
control policy
function approximators
genetic algorithm
learning algorithm
lattice points
temporal difference
radial basis function
robotic control
temporal difference learning
semidefinite
reinforcement learning algorithms
function approximation
data sets
random walk
state space
mobile robot
machine learning