Login / Signup

Reinformer: Max-Return Sequence Modeling for Offline RL.

Zifeng ZhuangDengyun PengJinxin LiuZiqi ZhangDonglin Wang
Published in: CoRR (2024)
Keyphrases
  • reinforcement learning
  • input data
  • data sets
  • real world
  • knowledge base
  • hidden markov models
  • probabilistic model
  • state space
  • long sequences