Sub-policy Adaptation for Hierarchical Reinforcement Learning.
Alexander C. LiCarlos FlorensaIgnasi ClaveraPieter AbbeelPublished in: CoRR (2019)
Keyphrases
- hierarchical reinforcement learning
- reward function
- markov decision process
- average reward
- reinforcement learning
- optimal policy
- model free
- markov decision processes
- state abstraction
- policy iteration
- action selection
- neural network
- data mining
- machine learning
- monte carlo
- reinforcement learning algorithms
- partially observable
- markov decision problems