Login / Signup
Active exploration by searching for experiments that falsify the computed control policy.
Raphael Fonteneau
Susan A. Murphy
Louis Wehenkel
Damien Ernst
Published in:
ADPRL (2011)
Keyphrases
</>
control policy
active exploration
reinforcement learning
long run
control policies
approximate dynamic programming
problem based learning
active learning
average cost
wireless sensor networks
data collection
small sample