Login / Signup
Decision-Focused Learning in Restless Multi-Armed Bandits with Application to Maternal and Child Care Domain.
Kai Wang
Shresth Verma
Aditya Mate
Sanket Shah
Aparna Taneja
Neha Madhiwalla
Aparna Hegde
Milind Tambe
Published in:
CoRR (2022)
Keyphrases
</>
multi armed bandits
learning process
learning algorithm
reinforcement learning
domain independent
online learning
young children
objective function
lower bound
dynamic programming
least squares
mutual information
game theory
learning tasks