An online prediction algorithm for reinforcement learning with linear function approximation using cross entropy method.

Published in: Mach. Learn. (2018)

Keyphrases