Min Max Generalization for Deterministic Batch Mode Reinforcement Learning: Relaxation Schemes.
Raphael FonteneauDamien ErnstBernard BoigelotQuentin LouveauxPublished in: SIAM J. Control. Optim. (2013)
Keyphrases
- min max
- batch mode
- reinforcement learning
- incremental learning
- batch mode active learning
- active learning
- control policy
- semi supervised
- learning algorithm
- supervised learning
- computationally expensive
- markov decision processes
- total cost
- labeled data
- multiple instance learning
- multi class
- feature vectors
- neural network