Login / Signup

Reinforcement learning using expectation maximization based guided policy search for stochastic dynamics.

Prakash MallickZhiyiong ChenMohsen Zamani
Published in: Neurocomputing (2022)
Keyphrases