An Efficient Active Learning Pipeline for Legal Text Classification.
Sepideh MamoolerRémi LebretStéphane MassonnetKarl AbererPublished in: CoRR (2022)
Keyphrases
- text classification
- active learning
- machine learning
- labeled data
- text categorization
- unlabeled data
- transfer learning
- n gram
- semi supervised
- learning strategies
- text data
- semantic features
- bag of words
- sentiment analysis
- feature selection
- data cleaning
- legal documents
- random sampling
- text classifiers
- naive bayes
- text mining
- knn
- text classification tasks
- learning algorithm
- information retrieval
- processing pipeline
- legal knowledge
- data sets
- case law
- supervised learning
- feature extraction
- knowledge base
- databases