Login / Signup
Applying Q-Learning to Non-Markovian Environments.
Jurij Chizhov
Arkady Borisov
Published in:
ICAART (2009)
Keyphrases
</>
reinforcement learning
learning algorithm
cooperative
state space
real world
optimal policy
function approximation
reward function
multi agent systems
markov decision processes
action selection
reinforcement learning algorithms
stochastic process