Keyphrases
- average reward
- markov decision processes
- optimal policy
- state space
- reinforcement learning
- support vector
- orders of magnitude
- dynamic programming
- domain specific
- infinite horizon
- learning experience
- linear combination
- expectation maximization
- pairwise
- multi agent
- sufficient conditions
- hidden markov models
- cost function
- prior knowledge
- moving objects
- belief revision