Login / Signup
Potential-based Reward Shaping in Sokoban.
Zhao Yang
Mike Preuss
Aske Plaat
Published in:
CoRR (2021)
Keyphrases
</>
reward shaping
complex domains
reinforcement learning
multi agent
mobile robot
complex environments