Reinforcement Learning under Model Mismatch.
Aurko RoyHuan XuSebastian PokuttaPublished in: CoRR (2017)
Keyphrases
- reinforcement learning
- high level
- statistical model
- probabilistic model
- experimental data
- probability distribution
- computational model
- neural network model
- neural network
- mathematical model
- process model
- theoretical framework
- optimal policy
- prior knowledge
- evolutionary algorithm
- objective function
- genetic algorithm
- machine learning