Using Large Language Model Annotations for Valid Downstream Statistical Inference in Social Science: Design-Based Semi-Supervised Learning.
Naoki EgamiMusashi Jacobs-HarukawaBrandon M. StewartHanying WeiPublished in: CoRR (2023)
Keyphrases
- semi supervised learning
- language model
- social sciences
- statistical inference
- semi supervised
- labeled data
- machine learning
- supervised learning
- language modeling
- unlabeled data
- n gram
- active learning
- statistical learning
- training data
- information retrieval
- unsupervised learning
- co training
- transfer learning
- label propagation
- probabilistic model
- text classification
- retrieval model
- statistical methods
- bayesian inference
- computer science
- query expansion
- graphical models
- pairwise
- decision trees
- feature selection