Login / Signup
Learning a Reward Function for User-Preferred Appliance Scheduling.
Nikolina Covic
Jochen Cremer
Hrvoje Pandzic
Published in:
CoRR (2023)
Keyphrases
</>
inverse reinforcement learning
reinforcement learning
active learning
machine learning
mobile robot
information extraction
supervised learning
maximum likelihood