Login / Signup
Intertemporal Price Discrimination: Structure and Computation of Optimal Policies.
Omar Besbes
Ilan Lobel
Published in:
Manag. Sci. (2015)
Keyphrases
</>
optimal policy
markov decision processes
dynamic programming
state space
finite state
multistage
long run
finite horizon
average reward reinforcement learning
reinforcement learning
decision problems
infinite horizon
sufficient conditions
semi markov decision processes
bayesian reinforcement learning