Login / Signup
The Effect of Modeling Human Rationality Level on Learning Rewards from Multiple Feedback Types.
Gaurav R. Ghosal
Matthew Zurek
Daniel S. Brown
Anca D. Dragan
Published in:
AAAI (2023)
Keyphrases
</>
bandit problems
language acquisition
learning algorithm
reinforcement learning
human learning
unsupervised learning
virtual environment
multiple tasks
multiple types
combining multiple
inductive inference
learning problems
higher level
knowledge acquisition
prior knowledge
learning process
decision trees