Keyphrases
- markov decision processes
- reinforcement learning
- factored mdps
- state space
- optimal policy
- decision theoretic planning
- finite horizon
- planning under uncertainty
- average cost
- markov decision process
- finite state
- model based reinforcement learning
- markov decision problems
- policy iteration
- real time dynamic programming
- decision processes
- dynamic programming
- dec pomdps
- continuous state and action spaces
- reinforcement learning algorithms
- decision diagrams
- probabilistic planning
- infinite horizon
- average reward
- data sets
- partially observable