Login / Signup
Offline Reinforcement Learning as One Big Sequence Modeling Problem.
Michael Janner
Qiyang Li
Sergey Levine
Published in:
NeurIPS (2021)
Keyphrases
</>
reinforcement learning
information systems
real time
function approximation
hidden state
database
website
multi agent
artificial neural networks
learning process
dynamic programming
least squares
supervised learning
markov chain
modeling language
model free