Small-Text: Active Learning for Text Classification in Python.
Christopher SchröderLydia MüllerAndreas NieklerMartin PotthastPublished in: EACL (System Demonstrations) (2023)
Keyphrases
- text classification
- active learning
- text data
- text documents
- text mining
- text classifiers
- labeled data
- document categorization
- bag of words
- feature selection
- open source
- transfer learning
- text categorization
- information retrieval
- programming language
- machine learning
- naive bayes
- n gram
- learning strategies
- text representation
- random sampling
- unlabeled data
- small number
- textual data
- training corpus
- training examples
- semantic features
- text classification tasks
- experimental design
- document classification
- automatic text classification
- data cleaning
- text collections
- text retrieval
- multi label
- supervised learning
- semi supervised
- knn
- information theoretic
- open source software
- data integration
- semi supervised learning
- natural language processing
- relevance feedback
- classification accuracy
- selective sampling
- keywords
- decision trees
- learning algorithm