GOPlan: Goal-conditioned Offline Reinforcement Learning by Planning with Learned Models.
Mianchu WangRui YangXi ChenMeng FangPublished in: CoRR (2023)
Keyphrases
- learned models
- reinforcement learning
- learning algorithm
- generative model
- classification models
- partially observable
- action selection
- function approximation
- data mining
- heuristic search
- neural network
- training data
- decision trees
- machine learning
- partial observability
- stochastic domains
- deterministic domains
- optimal control
- optimal policy
- image classification
- learning process