Login / Signup
Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization.
Haoran Xu
Li Jiang
Jianxiong Li
Zhuoran Yang
Zhaoran Wang
Wai Kin Victor Chan
Xianyuan Zhan
Published in:
CoRR (2023)
Keyphrases
</>
learning process
reinforcement learning
learning algorithm
action selection
learned knowledge
supervised learning
learning problems