Regret Bounds for Risk-sensitive Reinforcement Learning with Lipschitz Dynamic Risk Measures.

Hao Liang Zhi-Quan Luo

Published in: CoRR (2023)

Keyphrases

risk sensitive
risk measures
reinforcement learning
optimal control
model free
markov decision processes
regret bounds
utility function
state space
function approximation
machine learning
learning algorithm
decision making
control policies