Reinforcement learning with value advice.

Mayank Daswani Peter Sunehag Marcus Hutter

Published in: ACML (2014)

Keyphrases