Adaptive importance sampling for value function approximation in off-policy reinforcement learning.

Published in: Neural Networks (2009)

Keyphrases