Publication: Traffic Light Control Using Deep Policy-Gradient and Value-Function Based Reinforcement Learning.