Login / Signup
Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization.
Haoran Xu
Li Jiang
Jianxiong Li
Zhuoran Yang
Zhaoran Wang
Wai Kin Victor Chan
Xianyuan Zhan
Published in:
ICLR (2023)
Keyphrases
</>
reinforcement learning
learning process
learning algorithm
autonomous learning
action selection
action models
learning environment
active learning
supervised learning
collaborative learning
learning problems
partially observable
state action
multiagent reinforcement learning
partially observable domains