An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models.
Yangchen PanJunfeng WenChenjun XiaoPhilip H. S. TorrPublished in: CoRR (2024)
Keyphrases
- learning models
- temporal difference
- supervised learning
- learning tasks
- semi supervised learning
- learning problems
- function approximation
- learning algorithm
- reinforcement learning
- machine learning
- td learning
- evaluation function
- unsupervised learning
- monte carlo
- unlabeled data
- semi supervised
- machine learning algorithms
- model free
- labeled data
- loss function
- active learning
- action selection
- training data
- conditional random fields
- classification models
- training set
- step size
- class labels
- linear combination
- transfer learning
- text categorization
- decision trees