Regularized Soft Actor-Critic for Behavior Transfer Learning.
Mingxi TanAndong TianLudovic DenoyerPublished in: CoRR (2022)
Keyphrases
- transfer learning
- reinforcement learning
- actor critic
- learning tasks
- labeled data
- text categorization
- active learning
- policy gradient
- cross domain
- multi task
- machine learning
- transfer knowledge
- function approximation
- collaborative filtering
- semi supervised learning
- machine learning algorithms
- learning algorithm
- unlabeled data
- text classification
- text mining
- least squares
- average reward
- dimensionality reduction
- support vector machine
- state space
- target domain
- temporal difference
- dynamic programming
- neural network