Safe Counterfactual Reinforcement Learning.

Yusuke Narita Shota Yasui Kohei Yata

Published in: CoRR (2020)

Keyphrases

reinforcement learning
function approximation
temporal difference
machine learning
markov decision processes
learning algorithm
reinforcement learning algorithms
model free
neural network
website
multi agent
state space
relational reinforcement learning
real time
logical framework
dynamic programming
markov decision process
multi agent systems
function approximators
databases