Login / Signup

A Sublinear-Regret Reinforcement Learning Algorithm on Constrained Markov Decision Processes with reset action.

Takashi WatanabeTakashi Sakuragawa
Published in: ICMLSC (2020)
Keyphrases