Self-Training for Sample-Efficient Active Learning for Text Classification with Pre-Trained Language Models.
Christopher SchröderGerhard HeyerPublished in: CoRR (2024)
Keyphrases
- language model
- language modeling
- text classification
- n gram
- active learning
- pre trained
- language modelling
- document retrieval
- probabilistic model
- co training
- speech recognition
- information retrieval
- retrieval model
- query expansion
- semi supervised learning
- test collection
- labeled data
- training examples
- text categorization
- feature extraction
- feature selection
- smoothing methods
- semi supervised
- training data
- neural network
- statistical language modeling
- language models for information retrieval
- relevance model
- unlabeled data
- data mining