Tactical Reward Shaping: Bypassing Reinforcement Learning with Strategy-Based Goals.
Yizheng ZhangAndre RosendoPublished in: ROBIO (2019)
Keyphrases
- reward shaping
- reinforcement learning
- reinforcement learning algorithms
- complex domains
- function approximation
- strategic and tactical
- state space
- markov decision processes
- reward function
- optimal policy
- model free
- temporal difference
- learning process
- markov decision problems
- multi agent
- transfer learning
- learning algorithm
- optimal control
- dynamic programming