Learning and Planning for Time-Varying MDPs Using Maximum Likelihood Estimation.

Melkior Ornik Ufuk Topcu

Published in: CoRR (2019)

Keyphrases

maximum likelihood estimation
reinforcement learning
boltzmann machine
maximum likelihood
learning algorithm
prior knowledge
stochastic domains
machine learning
k means
supervised learning
parameter estimation
planning problems