Learning Reward Functions from Scale Feedback.

Nils Wilde Erdem Biyik Dorsa Sadigh Stephen L. Smith

Published in: CoRL (2021)

Keyphrases

learning algorithm
learning process
reinforcement learning
sufficient conditions
complex systems
action selection