Login / Signup
Solving the Rubik's Cube with Approximate Policy Iteration.
Stephen McAleer
Forest Agostinelli
Alexander Shmakov
Pierre Baldi
Published in:
ICLR (Poster) (2019)
Keyphrases
</>
markov decision problems
approximate policy iteration
reinforcement learning