Login / Signup
COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL.
Xiyao Wang
Ruijie Zheng
Yanchao Sun
Ruonan Jia
Wichayaporn Wongkamjan
Huazhe Xu
Furong Huang
Published in:
ICLR (2024)
Keyphrases
</>
model free
reinforcement learning
plan recognition
learning algorithm
data driven
supervised learning
high speed
state space
optimal policy
markov decision processes
planning process