Publication: Polynomial Time Reinforcement Learning in Factored State MDPs with Linear Value Functions.