Login / Signup
A Comparison of Reward Functions in Q-Learning Applied to a Cart Position Problem.
Amartya Mukherjee
Published in:
CoRR (2021)
Keyphrases
</>
reward function
optimal policy
reinforcement learning
state space
learning algorithm
multi agent systems
infinite horizon
reinforcement learning algorithms