Login / Signup

Adaptive importance sampling for value function approximation in off-policy reinforcement learning.

Hirotaka HachiyaTakayuki AkiyamaMasashi SugiyamaJan Peters
Published in: Neural Networks (2009)
Keyphrases