Login / Signup

Sparse randomized policies for Markov decision processes based on Tsallis divergence regularization.

Pierre LeleuxBertrand LebichotGuillaume GuexMarco Saerens
Published in: Knowl. Based Syst. (2024)
Keyphrases