Login / Signup
Reward Learning from Suboptimal Demonstrations with Applications in Surgical Electrocautery.
Zohre Karimi
Shing-Hei Ho
Bao Thach
Alan Kuntz
Daniel S. Brown
Published in:
CoRR (2024)
Keyphrases
</>
reinforcement learning
prior knowledge
learning process
learning algorithm
hidden markov models
learning scenarios
data sets
machine learning
active learning
dynamic programming
knowledge acquisition
learning problems
learning scheme
partially observable environments