Publication: An immediate-return reinforcement learning for the atypical Markov decision processes.