Minimax Regret for Stochastic Shortest Path.
Alon CohenYonathan EfroniYishay MansourAviv RosenbergPublished in: CoRR (2021)
Keyphrases
- minimax regret
- stochastic shortest path
- markov decision processes
- preference elicitation
- utility function
- reward function
- markov decision problems
- decision problems
- stochastic programming
- misclassification costs
- cost sensitive
- reinforcement learning
- partially observable
- class distribution
- optimal policy
- linear programming
- training and test data
- least squares
- dynamic programming
- machine learning