Proportional Aggregation of Preferences for Sequential Decision Making.
Nikhil ChandakShashwat GoelDominik PetersPublished in: AAAI (2024)
Keyphrases
- sequential decision making
- decision problems
- reinforcement learning
- influence diagrams
- decision making
- interactive dynamic influence diagrams
- temporal difference
- special case
- expected utility
- preference aggregation
- bayesian networks
- computational complexity
- multi objective
- optimal policy
- function approximation
- multiple objectives