Simplified Risk-aware Decision Making with Belief-dependent Rewards in Partially Observable Domains.
Andrey ZhitnikovVadim IndelmanPublished in: Artif. Intell. (2022)
Keyphrases
- decision making
- partially observable domains
- reinforcement learning
- expected utility
- decision makers
- partially observable
- markov decision processes
- inverse reinforcement learning
- decision theory
- reward function
- supply chain
- belief functions
- belief state
- action selection
- partially observable markov decision processes
- state space
- utility function
- infinite horizon