Learning Purposeful Behaviour in the Absence of Rewards.
Marlos C. MachadoMichael H. BowlingPublished in: CoRR (2016)
Keyphrases
- reinforcement learning
- learning algorithm
- learning systems
- online learning
- learning process
- artificial intelligence
- information retrieval
- probabilistic model
- active learning
- learning mechanism
- database
- background knowledge
- learning activities
- prior knowledge
- image sequences
- case study
- decision making
- search engine
- real time