Efficiently Solving MDPs with Stochastic Mirror Descent.
Yujia JinAaron SidfordPublished in: ICML (2020)
Keyphrases
- semi markov decision processes
- markov decision processes
- markov decision problems
- state space
- genetic algorithm
- decision theoretic planning
- algebraic decision diagrams
- probability distribution
- optimal policy
- sequential decision making problems
- partially observable
- finite horizon
- initial state
- factored markov decision processes
- transition matrices
- factored mdps
- approximate dynamic programming
- markov decision process
- state transition
- combinatorial optimization
- monte carlo
- evolutionary algorithm
- reinforcement learning