Publication: A Deep Q-Learning Approach for Continuous Review Policies with Uncertain Lead Time Demand Patterns.