Publication: Deterministic limit of temporal difference reinforcement learning for stochastic games.