Login / Signup
A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning.
Christoph Dann
Mehryar Mohri
Tong Zhang
Julian Zimmert
Published in:
CoRR (2022)
Keyphrases
</>
model free
reinforcement learning
function approximation
reinforcement learning algorithms
policy iteration
temporal difference
learning process
reinforcement learning methods
machine learning
state space
average reward
multi agent
least squares
policy evaluation
impedance control