Login / Signup
Research and Application of Reinforcement Learning Based on Constraint MDP in Coal Mine.
Xiao-hu Zhao
Ke-ke Zhao
Qing-qing Wang
Fang-qing Ma
Published in:
CSIE (4) (2009)
Keyphrases
</>
reinforcement learning
markov decision processes
optimal policy
state space
learning process
function approximation
least squares
markov decision process
genetic algorithm
learning algorithm