Login / Signup
Refined Value-Based Offline RL under Realizability and Partial Coverage.
Masatoshi Uehara
Nathan Kallus
Jason D. Lee
Wen Sun
Published in:
CoRR (2023)
Keyphrases
</>
reinforcement learning
real time
optimal policy
function approximation
database
machine learning
learning algorithm
model free
databases
supervised learning