A reinforcement learning approach to rare trajectory sampling.
Dominic C. RoseJamie F. MairJuan P. GarrahanPublished in: CoRR (2020)
Keyphrases
- reinforcement learning
- sample size
- model free
- learning algorithm
- state space
- reinforcement learning algorithms
- monte carlo
- random sampling
- function approximation
- reinforcement learning methods
- sampling strategy
- optimal policy
- machine learning
- sampling strategies
- parameter space
- markov decision processes
- temporal difference
- video data
- sampling algorithm
- trajectory data
- dynamic programming
- learning process
- transition model
- policy search
- robotic control