Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards.
Siyuan LiRui WangMinxue TangChongjie ZhangPublished in: NeurIPS (2019)
Keyphrases
- hierarchical reinforcement learning
- reinforcement learning
- reward function
- markov decision processes
- state abstraction
- state space
- model free
- reinforcement learning algorithms
- average reward
- data mining
- dynamic programming
- mobile robot
- dynamic systems
- function approximation
- partially observable
- markov decision process