Stabilizing Transformers for Reinforcement Learning.

Emilio Parisotto H. Francis Song Jack W. Rae Razvan Pascanu Çaglar Gülçehre Siddhant M. Jayakumar Max Jaderberg Raphael Lopez Kaufman Aidan Clark Seb Noury Matthew M. Botvinick Nicolas Heess Raia Hadsell

Published in: CoRR (2019)

Keyphrases

reinforcement learning
function approximation
state space
temporal difference
multi agent reinforcement learning
nonlinear systems
markov decision processes
optimal policy
reinforcement learning algorithms
dynamic programming
transfer learning
control problems
robotic control
model free
multi agent
temporal difference learning
partial discharge
stochastic approximation
relational reinforcement learning
database
action space
supervised learning
least squares
learning process
expert systems
data sets