Knowledge Distillation in Automated Annotation: Supervised Text Classification with LLM-Generated Training Labels.
Nicholas PangakisSamuel WolkenPublished in: CoRR (2024)
Keyphrases
- text classification
- supervised learning
- training set
- multi label
- feature selection
- knowledge base
- expert systems
- domain knowledge
- training examples
- knowledge acquisition
- subject matter experts
- text classifiers
- machine learning
- n gram
- data cleaning
- training phase
- neural network
- bag of words
- ground truth labels
- naive bayes
- labeled data
- unsupervised learning
- data mining techniques
- knowledge management
- knn
- pairwise
- learning algorithm
- data mining