Correction to: Model-free inverse reinforcement learning with multi-intention, unlabeled, and overlapping demonstrations.
Ariyan BighashdelPavol JancuraGijs DubbelmanPublished in: Mach. Learn. (2023)
Keyphrases
- model free
- inverse reinforcement learning
- temporal difference
- reinforcement learning
- reinforcement learning algorithms
- reward function
- policy iteration
- function approximation
- preference elicitation
- class labels
- unsupervised learning
- training set
- supervised learning
- semi supervised learning
- markov decision processes
- average reward
- machine learning
- labeled data
- active learning
- learning algorithm