Online Semi-Supervised Learning in Contextual Bandits with Episodic Reward.
Baihan LinPublished in: CoRR (2020)
Keyphrases
- semi supervised learning
- semi supervised
- unlabeled data
- labeled data
- semi supervised classification
- supervised learning
- co training
- unsupervised learning
- manifold regularization
- labeled examples
- reinforcement learning
- machine learning
- learning problems
- training data
- label propagation
- semi supervised learning algorithms
- online learning
- graph based semi supervised learning
- transfer learning
- semi supervised learning methods
- active learning
- graph construction
- multi armed bandit
- domain adaptation
- learning models
- regularization framework
- unlabeled samples
- labeled and unlabeled data
- metric learning
- training examples
- learning algorithm
- information retrieval