Research and Application of Reinforcement Learning Based on Constraint MDP in Coal Mine.

Xiao-hu Zhao Ke-ke Zhao Qing-qing Wang Fang-qing Ma

Published in: CSIE (4) (2009)

Keyphrases

reinforcement learning
markov decision processes
optimal policy
state space
learning process
function approximation
least squares
markov decision process
genetic algorithm
learning algorithm