Keyphrases
- temporal difference
- td learning
- reinforcement learning
- function approximation
- evaluation function
- monte carlo
- bit rate
- model free
- reinforcement learning algorithms
- image segmentation
- temporal difference learning
- step size
- policy evaluation
- policy iteration
- action selection
- optical flow
- supervised learning
- temporal difference methods
- actor critic
- neural network
- learning process