Reward Learning from Suboptimal Demonstrations with Applications in Surgical Electrocautery.

Published in: CoRR (2024)

Keyphrases