Login / Signup
Contextual Bandits for Evaluating and Improving Inventory Control Policies.
Dean P. Foster
Randy Jia
Dhruv Madeka
Published in:
CoRR (2023)
Keyphrases
</>
control policies
finite horizon
stochastic optimization problems
optimal policy
control policy
reinforcement learning
motion control
supply chain
control strategies
action space
control system
decision making
state space