Epoch-incremental reinforcement learning algorithms.
Roman ZajdelPublished in: Int. J. Appl. Math. Comput. Sci. (2013)
Keyphrases
- reinforcement learning algorithms
- reinforcement learning
- markov decision processes
- state space
- model free
- eligibility traces
- reinforcement learning problems
- function approximation
- learning algorithm
- temporal difference
- policy search
- reward function
- stochastic games
- reinforcement learning methods
- dynamic programming
- partially observable environments
- optimal policy
- higher order
- hidden markov models
- policy iteration
- policy gradient
- prior knowledge
- neural network
- tabula rasa