Publication: Near-Optimal Provable Uniform Convergence in Offline Policy Evaluation for Reinforcement Learning.