Publication: Combinations and Mixtures of Optimal Policies in Unichain Markov Decision Processes are Optimal