Safe Exploration of State and Action Spaces in Reinforcement Learning.
Javier GarcíaFernando FernándezPublished in: CoRR (2014)
Keyphrases
- state and action spaces
- reinforcement learning
- markov decision processes
- action space
- state space
- markov decision problems
- partially observable markov decision process
- average reward
- action selection
- partially observable markov decision processes
- stochastic processes
- neural network
- reinforcement learning algorithms
- decision theoretic
- function approximation
- policy iteration
- temporal difference
- function approximators
- search strategies
- linear programming