Convergence and numerical stability of action-dependent heuristic dynamic programming algorithms based on RLS learning for online DLQR optimal control.
Guilherme Bonfim De SousaPatrícia Helena Moraes RêgoPublished in: Int. J. Comput. Sci. Eng. (2019)