GOPlan: Goal-conditioned Offline Reinforcement Learning by Planning with Learned Models.
Mianchu WangRui YangXi ChenHao SunMeng FangGiovanni MontanaPublished in: Trans. Mach. Learn. Res. (2024)
Keyphrases
- learned models
- reinforcement learning
- learning algorithm
- action selection
- generative model
- planning problems
- deterministic domains
- partially observable
- learning process
- domain independent
- function approximation
- optimal control
- multi agent
- machine learning
- classification accuracy
- probabilistic model
- training data
- complex domains
- real time