Integrating Reward Maximization and Population Estimation: Sequential Decision-Making for Internal Revenue Service Audit Selection.
Peter HendersonBen ChuggBrandon R. AndersonKristen M. AltenburgerAlex TurkJohn GuytonJacob GoldinDaniel E. HoPublished in: AAAI (2023)
Keyphrases
- sequential decision making
- reinforcement learning
- interactive dynamic influence diagrams
- decision problems
- influence diagrams
- evolutionary algorithm
- function approximation
- web services
- long run
- objective function
- markov decision processes
- optimal policy
- state space
- dynamic programming
- special case
- temporal difference
- expected utility
- data mining