Tactical Reward Shaping: Bypassing Reinforcement Learning with Strategy-Based Goals.

Yizheng Zhang Andre Rosendo

Published in: ROBIO (2019)

Keyphrases

reward shaping
reinforcement learning
reinforcement learning algorithms
complex domains
function approximation
strategic and tactical
state space
markov decision processes
reward function
optimal policy
model free
temporal difference
learning process
markov decision problems
multi agent
transfer learning
learning algorithm
optimal control
dynamic programming