Login / Signup
Learning to Walk in Minutes Using Massively Parallel Deep Reinforcement Learning.
Nikita Rudin
David Hoeller
Philipp Reist
Marco Hutter
Published in:
CoRR (2021)
Keyphrases
</>
reinforcement learning
massively parallel
learning algorithm
learning process
learning tasks
supervised learning
optimal policy
parallel computing
special case
probabilistic model
state space
markov decision processes
learning problems