• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization.

Haoran XuLi JiangJianxiong LiZhuoran YangZhaoran WangWai Kin Victor ChanXianyuan Zhan
Published in: CoRR (2023)
Keyphrases
  • learning process
  • reinforcement learning
  • learning algorithm
  • action selection
  • learned knowledge
  • supervised learning
  • learning problems