Keyphrases
- actor critic
- reinforcement learning
- policy gradient
- temporal difference
- optimal control
- neuro fuzzy
- approximate dynamic programming
- gradient method
- neural network
- function approximation
- policy iteration
- reinforcement learning algorithms
- markov decision processes
- decision making
- evaluation function
- cost function
- linear program
- radial basis function
- step size
- dynamic programming
- artificial neural networks