Login / Signup
Reinformer: Max-Return Sequence Modeling for Offline RL.
Zifeng Zhuang
Dengyun Peng
Jinxin Liu
Ziqi Zhang
Donglin Wang
Published in:
CoRR (2024)
Keyphrases
</>
reinforcement learning
input data
data sets
real world
knowledge base
hidden markov models
probabilistic model
state space
long sequences