A survey of benchmarks for reinforcement learning algorithms.
Belinda StapelbergKatherine Mary MalanPublished in: South Afr. Comput. J. (2020)
Keyphrases
- markov decision problems
- reinforcement learning algorithms
- reinforcement learning
- state space
- reinforcement learning problems
- markov decision processes
- reward function
- function approximators
- reward shaping
- eligibility traces
- model free
- reinforcement learning methods
- dynamic programming
- function approximation
- temporal difference
- neural network
- policy search
- stochastic games
- partially observable environments
- training data
- monte carlo
- particle filter
- multiagent reinforcement learning
- markov chain
- learning algorithm