Login / Signup
COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL.
Xiyao Wang
Ruijie Zheng
Yanchao Sun
Ruonan Jia
Wichayaporn Wongkamjan
Huazhe Xu
Furong Huang
Published in:
CoRR (2023)
Keyphrases
</>
reinforcement learning
model free
high speed
plan recognition
multi agent
database systems
multi agent systems
domain independent
function approximation
optimal control
reinforcement learning algorithms
plan execution