Login / Signup

Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization.

Haoran XuLi JiangJianxiong LiZhuoran YangZhaoran WangWai Kin Victor ChanXianyuan Zhan
Published in: CoRR (2023)
Keyphrases
  • learning process
  • reinforcement learning
  • learning algorithm
  • action selection
  • learned knowledge
  • supervised learning
  • learning problems