Policy Iteration Reinforcement Learning-based control using a Grey Wolf Optimizer algorithm.
Iuliu Alexandru ZamfiracheRadu-Emil PrecupRaul-Cristian RomanEmil M. PetriuPublished in: Inf. Sci. (2022)
Keyphrases
- policy iteration
- reinforcement learning
- model free
- stochastic approximation
- optimization algorithm
- dynamic programming
- learning algorithm
- markov decision processes
- objective function
- cost function
- np hard
- optimal control
- sample path
- approximate dynamic programming
- monte carlo
- mathematical model
- fixed point
- convergence rate
- energy function
- particle swarm optimization
- linear programming
- control policy
- least squares
- control system
- optimal solution