Publication: Reinforcement Learning Considering Worst Case and Equality within Episodes.