Reinforcement Learning under Model Mismatch.

Aurko Roy Huan Xu Sebastian Pokutta

Published in: CoRR (2017)

Keyphrases

reinforcement learning
high level
statistical model
probabilistic model
experimental data
probability distribution
computational model
neural network model
neural network
mathematical model
process model
theoretical framework
optimal policy
prior knowledge
evolutionary algorithm
objective function
genetic algorithm
machine learning