Mathematics of statistical sequential decision-making: concentration, risk-awareness and modelling in stochastic bandits, with applications to bariatric surgery.
Patrick SauxPublished in: CoRR (2024)
Keyphrases
- sequential decision making
- reinforcement learning
- decision problems
- stochastic systems
- expected utility
- interactive dynamic influence diagrams
- influence diagrams
- computer assisted
- intraoperative
- neural network
- monte carlo
- special case
- confidence intervals
- multi armed bandit
- sensitivity analysis
- utility function
- decision theoretic
- temporal difference
- decision making
- machine learning