Solving the Rubik's Cube with Approximate Policy Iteration.

Stephen McAleer Forest Agostinelli Alexander Shmakov Pierre Baldi

Published in: ICLR (Poster) (2019)

Keyphrases

markov decision problems
approximate policy iteration
reinforcement learning