Publication: Reinforcement learning for exploratory linear-quadratic two-person zero-sum stochastic differential games.