Publication: Model-Based Actor-Critic for Multi-Objective Reinforcement Learning with Dynamic Utility Functions.