Login / Signup
Reward Shaping with Subgoals for Social Navigation.
Takato Okudo
Seiji Yamada
Published in:
CoRR (2021)
Keyphrases
</>
reward shaping
reinforcement learning
complex domains
reinforcement learning algorithms
state space
markov decision problems
supervised learning
machine learning
sufficient conditions
model free