Pathological Effects of Variance on Classification-Based Policy Iteration.
Bernardo Ávila PiresCsaba SzepesváriPublished in: AAAI Workshop: Learning for General Competency in Video Games (2015)
Keyphrases
- policy iteration
- markov decision processes
- feature selection
- support vector
- reinforcement learning
- feature space
- support vector machine svm
- optimal policy
- machine learning
- decision trees
- state space
- supervised learning
- machine learning algorithms
- model free
- feature vectors
- text classification
- monte carlo
- decision making