Publication: Finding optimal memoryless policies of POMDPs under the expected average reward criterion.