PAC Optimal Planning for Invasive Species Management: Improved Exploration for Reinforcement Learning from Simulator-Defined MDPs.
Thomas G. DietterichMajid Alkaee TaleghanMark CrowleyPublished in: AAAI (2013)
Keyphrases
- reinforcement learning
- optimal planning
- markov decision processes
- model based reinforcement learning
- state space
- optimal policy
- heuristic search
- learning algorithm
- function approximation
- state space search
- planning problems
- markov decision process
- machine learning
- domain independent
- policy search
- average reward
- action selection
- dynamic programming
- reward function
- reinforcement learning algorithms
- partially observable
- action space
- upper bound
- state and action spaces
- model free
- planning domains
- finite state
- search algorithm
- markov decision problems
- orders of magnitude