AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning.

Michaël Mathieu Sherjil Ozair Srivatsan Srinivasan Çaglar Gülçehre Shangtong Zhang Ray Jiang Tom Le Paine Richard Powell Konrad Zolna Julian Schrittwieser David H. Choi Petko Georgiev Daniel Toyama Aja Huang Roman Ring Igor Babuschkin Timo Ewalds Mahyar Bordbar Sarah Henderson Sergio Gómez Colmenarejo Aäron van den Oord Wojciech Marian Czarnecki Nando de Freitas Oriol Vinyals

Published in: CoRR (2023)

Keyphrases

reinforcement learning
real time
small scale
function approximation
real life
real world
machine learning
markov decision processes
dynamic programming
state space
model free
optimal policy
supervised learning
learning process
expert systems
learning environment
multiscale
temporal difference
reinforcement learning algorithms
control problems
robotic control