Model-Free and Model-Based Policy Evaluation when Causality is Uncertain.

David Bruns-Smith

Published in: ICML (2021)

Keyphrases

model free
policy evaluation
reinforcement learning
policy iteration
temporal difference
reinforcement learning algorithms
function approximation
neural network
feature selection
decision making
machine learning
training set
support vector machine
least squares
optimal control