Optimal Transport-Assisted Risk-Sensitive Q-Learning.
Zahra ShahrooeiAli BaheriPublished in: CoRR (2024)
Keyphrases
- risk sensitive
- optimal control
- risk neutral
- optimality criterion
- model free
- reinforcement learning
- dynamic programming
- cooperative
- control policies
- multi agent
- state space
- average cost
- optimal policy
- utility function
- control strategy
- function approximation
- learning algorithm
- markov decision processes
- infinite horizon
- policy iteration