Publication: No-Regret Reinforcement Learning with Value Function Approximation: a Kernel Embedding Approach.