Publication: Reinforcement learning from simultaneous human and MDP reward.