Sequence Prediction with Unlabeled Data by Reward Function Learning.
Lijun WuLi ZhaoTao QinJianhuang LaiTie-Yan LiuPublished in: IJCAI (2017)
Keyphrases
- unlabeled data
- learning algorithm
- supervised learning
- active learning
- labeled and unlabeled data
- semi supervised learning
- co training
- reinforcement learning
- semi supervised
- labeled data
- learning process
- sequence prediction
- class labels
- reward function
- learning tasks
- data points
- training data
- unsupervised learning
- learning problems
- decision theoretic
- inverse reinforcement learning
- training set