Performance analysis of model-free PID tuning of MIMO systems based on simultaneous perturbation stochastic approximation.
Mohd Ashraf AhmadShun-Ichi AzumaToshiharu SugiePublished in: Expert Syst. Appl. (2014)
Keyphrases
- stochastic approximation
- model free
- mimo systems
- policy iteration
- reinforcement learning
- low complexity
- reinforcement learning algorithms
- function approximation
- control algorithm
- control system
- temporal difference
- temporal difference learning
- rl algorithms
- average reward
- control method
- video transmission
- wireless channels
- pid controller
- markov decision processes
- monte carlo
- fading channels
- neural network
- computational complexity
- search space
- control scheme
- control strategy
- linear programming
- multiple input multiple output