Login / Signup

Sample-efficient Learning of Infinite-horizon Average-reward MDPs with General Function Approximation.

Jianliang HeHan ZhongZhuoran Yang
Published in: CoRR (2024)
Keyphrases