Login / Signup
Criteria for selecting the relaxation factor of the value iteration algorithm for undiscounted Markov and semi-Markov decision processes.
Meir Herzberg
Uri Yechiali
Published in:
Oper. Res. Lett. (1991)
Keyphrases
</>
dynamic programming
learning algorithm
selection algorithm
probabilistic model
markov decision processes
objective function
k means
machine learning
average reward
iterative algorithms
markov model
heuristic search
semi markov decision processes
energy function
markov chain
np hard
multi agent