Publication: Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning.