Learning Rewards from Linguistic Feedback.
Theodore R. SumersMark K. HoRobert X. D. HawkinsKarthik NarasimhanThomas L. GriffithsPublished in: CoRR (2020)
Keyphrases
- reinforcement learning
- learning process
- learning algorithm
- active learning
- real time
- e learning
- online learning
- learning systems
- knowledge acquisition
- erroneous examples
- assessment tool
- learning mechanism
- learning tasks
- markov decision processes
- background knowledge
- multi agent
- search engine
- machine learning
- neural network