Keyphrases
- reinforcement learning
- profit sharing
- knowledge base
- reinforcement learning algorithms
- state space
- temporal difference
- function approximation
- first order logic
- learning algorithm
- markov decision processes
- multi agent
- optimal policy
- action space
- rational agents
- decision making
- model free
- data sets
- policy search
- robotic control
- optimal control
- active learning
- learning process
- case study
- learning classifier systems
- dynamic programming
- markov decision process
- machine learning
- database
- axiomatic characterization