Convergence and Stability of Optimal Regulation via Generalized N-Step Value Gradient Learning.
Ding WangMingming ZhaoMingming HaJunfei QiaoPublished in: IEEE Trans. Neural Networks Learn. Syst. (2024)
Keyphrases
- learning algorithm
- online learning
- learning problems
- learning systems
- machine learning
- dynamic programming
- learning scheme
- prior knowledge
- learning tasks
- background knowledge
- learning phase
- incremental learning
- unsupervised learning
- supervised learning
- active learning
- learning process
- training set
- reinforcement learning
- multiscale