Risk-Averse Learning by Temporal Difference Methods with Markov Risk Measures.

Umit KoseAndrzej Ruszczynski
Published in: J. Mach. Learn. Res. (2021)