Reward Shaping with Subgoals for Social Navigation.

Takato Okudo Seiji Yamada

Published in: CoRR (2021)

Keyphrases

reward shaping
reinforcement learning
complex domains
reinforcement learning algorithms
state space
markov decision problems
supervised learning
machine learning
sufficient conditions
model free