Behaviour Suite for Reinforcement Learning.

Ian Osband Yotam Doron Matteo Hessel John Aslanides Eren Sezener Andre Saraiva Katrina McKinney Tor Lattimore Csaba Szepesvári Satinder Singh Benjamin Van Roy Richard S. Sutton David Silver Hado van Hasselt

Published in: CoRR (2019)

Keyphrases

reinforcement learning
reward function
function approximation
learning algorithm
state space
user behaviour
model free
neural network
transition model
optimal policy
direct policy search
continuous state
control problems
reinforcement learning algorithms
learning classifier systems
optimal control
transfer learning
real time
learning process
machine learning
evolutionary algorithm
temporal difference
case study
knowledge base
social networks
information retrieval
databases