Learning Constraints From Human Stop-Feedback in Reinforcement Learning.
Silvia PolettiAlberto TestolinSebastian TschiatschekPublished in: AAMAS (2023)
Keyphrases
- reinforcement learning
- learning process
- learning algorithm
- active learning
- tutorial dialogue
- knowledge acquisition
- online learning
- supervised learning
- prior knowledge
- learning systems
- markov decision processes
- constraint satisfaction
- human experts
- inductive inference
- autonomous learning
- multi agent reinforcement learning
- state abstraction
- multi agent