Login / Signup
Learning Reward Functions from Scale Feedback.
Nils Wilde
Erdem Biyik
Dorsa Sadigh
Stephen L. Smith
Published in:
CoRL (2021)
Keyphrases
</>
learning algorithm
learning process
reinforcement learning
sufficient conditions
complex systems
action selection