A survey of benchmarks for reinforcement learning algorithms.

Belinda Stapelberg Katherine Mary Malan

Published in: South Afr. Comput. J. (2020)

Keyphrases

markov decision problems
reinforcement learning algorithms
reinforcement learning
state space
reinforcement learning problems
markov decision processes
reward function
function approximators
reward shaping
eligibility traces
model free
reinforcement learning methods
dynamic programming
function approximation
temporal difference
neural network
policy search
stochastic games
partially observable environments
training data
monte carlo
particle filter
multiagent reinforcement learning
markov chain
learning algorithm