Reinforcement Learning by Value Gradients

Michael Fairbank

Published in: CoRR (2008)

Keyphrases

reinforcement learning
state space
function approximation
temporal difference
model free
markov decision processes
reinforcement learning algorithms
learning algorithm
multi agent
search algorithm
action selection
data sets
multi agent reinforcement learning
evolutionary learning
control problems
optimal policy
decision making
real time
dynamic programming
multi agent systems
knowledge base
learning capabilities
neural network
robot control
gradient information
temporal difference learning
databases
transition model
robotic control