Login / Signup

Efficient Exploration in Average-Reward Constrained Reinforcement Learning: Achieving Near-Optimal Regret With Posterior Sampling.

Danil ProvodinMaurits KapteinMykola Pechenizkiy
Published in: CoRR (2024)
Keyphrases