Login / Signup
Regret Bounds for Risk-sensitive Reinforcement Learning with Lipschitz Dynamic Risk Measures.
Hao Liang
Zhi-Quan Luo
Published in:
CoRR (2023)
Keyphrases
</>
risk sensitive
risk measures
reinforcement learning
optimal control
model free
markov decision processes
regret bounds
utility function
state space
function approximation
machine learning
learning algorithm
decision making
control policies