Refined Value-Based Offline RL under Realizability and Partial Coverage.

Masatoshi Uehara Nathan Kallus Jason D. Lee Wen Sun

Published in: CoRR (2023)

Keyphrases

reinforcement learning
real time
optimal policy
function approximation
database
machine learning
learning algorithm
model free
databases
supervised learning