Applying Q-Learning to Non-Markovian Environments.

Jurij Chizhov Arkady Borisov

Published in: ICAART (2009)

Keyphrases

reinforcement learning
learning algorithm
cooperative
state space
real world
optimal policy
function approximation
reward function
multi agent systems
markov decision processes
action selection
reinforcement learning algorithms
stochastic process