Keyphrases
- model free
- reinforcement learning
- function approximation
- clustering algorithm
- reinforcement learning algorithms
- temporal difference
- sample size
- data points
- policy iteration
- monte carlo
- policy evaluation
- singular value decomposition
- genetic algorithm
- average reward
- impedance control
- feature extraction
- low rank approximation