Learning Rewards From Linguistic Feedback.
Theodore R. SumersMark K. HoRobert X. D. HawkinsKarthik NarasimhanThomas L. GriffithsPublished in: AAAI (2021)
Keyphrases
- reinforcement learning
- online learning
- learning algorithm
- learning process
- learning problems
- markov decision processes
- knowledge acquisition
- learning analytics
- neural network
- incremental learning
- learning tasks
- mobile learning
- supervised learning
- information extraction
- dynamic programming
- prior knowledge
- decision trees