Mine-to-client planning with Markov Decision Process.
João Marcelo Leal Gomes LeiteEdilson F. ArrudaLaura BahienseLino G. MarujoPublished in: ECC (2020)
Keyphrases
- markov decision process
- state space
- probabilistic planning
- partial observability
- initial state
- markov decision processes
- optimal policy
- reinforcement learning
- planning problems
- finite horizon
- heuristic search
- infinite horizon
- transition matrices
- partially observable
- policy iteration
- average cost
- machine learning
- transition probabilities
- decision theoretic
- classical planning
- situation calculus
- markov chain
- prior knowledge
- stationary policies