A Lyapunov-based version of the value iteration algorithm formulated as a discrete-time switched affine system.
Raffaele IervolinoMassimo TipaldiAli ForootaniPublished in: Int. J. Control (2023)
Keyphrases
- dynamic programming
- optimal solution
- cost function
- k means
- optimization algorithm
- segmentation algorithm
- computational cost
- preprocessing
- recognition algorithm
- objective function
- learning algorithm
- search space
- computational complexity
- markov decision processes
- worst case
- linear programming
- mathematical model
- semidefinite programming
- expectation maximization
- probabilistic model
- search algorithm
- reinforcement learning