Learning rewards from exploratory demonstrations using probabilistic temporal ranking.
Michael BurkeKatie LuDaniel AngelovArturas StraizysCraig InnesKartic SubrSubramanian RamamoorthyPublished in: Auton. Robots (2023)
Keyphrases
- reinforcement learning
- learning algorithm
- learning process
- learning tasks
- uncertain data
- learning systems
- temporal reasoning
- prior knowledge
- online learning
- preference learning
- cognitive load
- inductive inference
- spatial and temporal
- generative model
- neural network
- probabilistic model
- mobile devices
- decision trees
- information retrieval
- machine learning