Hindsight Learning for MDPs with Exogenous Inputs.

Sean R. Sinclair Felipe Frujeri Ching-An Cheng Adith Swaminathan

Published in: CoRR (2022)

Keyphrases

learning algorithm
reinforcement learning
learning process
prior knowledge
neural network
online learning
learning systems
data sets
least squares
supervised learning
unsupervised learning
action selection