Login / Signup

Reinforcement learning from human reward: Discounting in episodic tasks.

W. Bradley KnoxPeter Stone
Published in: RO-MAN (2012)
Keyphrases