Publication: A Reinforcement Learning Approach to Optimize Discount and Reputation Tradeoffs in E-commerce Systems.