Publication: Short Quantum Circuits in Reinforcement Learning Policies for the Vehicle Routing Problem.