Publication: MEPE: A Minimalist Ensemble Policy Evaluation Operator for Deep Reinforcement Learning.