Model-based Trajectory Stitching for Improved Offline Reinforcement Learning.
Charles A. HepburnGiovanni MontanaPublished in: CoRR (2022)
Keyphrases
- reinforcement learning
- model free
- function approximation
- learning algorithm
- state space
- real world
- learning process
- data sets
- machine learning
- temporal difference
- spatio temporal
- decision trees
- search engine
- dynamic environments
- improved algorithm
- learning classifier systems
- action selection
- multi agent reinforcement learning
- transition model