Risk-Sensitive RL with Optimized Certainty Equivalents via Reduction to Standard RL.
Kaiwen WangDawen LiangNathan KallusWen SunPublished in: CoRR (2024)
Keyphrases
- risk sensitive
- reinforcement learning
- model free
- markov decision processes
- optimal control
- control policies
- reinforcement learning algorithms
- action space
- state space
- optimal policy
- function approximation
- markov decision chains
- dynamic programming
- control strategies
- utility function
- finite state
- control policy
- average reward
- evolutionary algorithm
- multi agent
- expected utility
- decision making
- objective function