Login / Signup

Convergence and stability analysis of value iteration Q-learning under non-discounted cost for discrete-time optimal control.

Shijie SongMingming ZhaoDawei GongMinglei Zhu
Published in: Neurocomputing (2024)
Keyphrases