Publication: Tackling Morpion Solitaire with AlphaZero-like Ranked Reward Reinforcement Learning.