What Can Transformer Learn with Varying Depth? Case Studies on Sequence Learning Tasks.
Xingwu ChenDifan ZouPublished in: CoRR (2024)
Keyphrases
- learning tasks
- case study
- meta learning
- learning problems
- transfer learning
- learning algorithm
- machine learning
- learning experience
- learning agent
- multi task
- previously learned
- supervised learning
- machine learning algorithms
- hypothesis space
- multi label
- function approximation
- kernel methods
- multitask learning
- multi task learning
- function approximators
- kernel based learning methods
- real world
- labeled and unlabeled data