A Comparison of Reward Functions in Q-Learning Applied to a Cart Position Problem.

Amartya Mukherjee

Published in: CoRR (2021)

Keyphrases

reward function
optimal policy
reinforcement learning
state space
learning algorithm
multi agent systems
infinite horizon
reinforcement learning algorithms