Login / Signup

Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation.

Voot TangkarattSyogo MoriTingting ZhaoJun MorimotoMasashi Sugiyama
Published in: Neural Networks (2014)
Keyphrases