Publication: Q-Learning for Feedback Nash Strategy of Finite-Horizon Nonzero-Sum Difference Games.