A large reproducible benchmark on text classification for the legal domain based on the ECHR-OD repository.
Alexandre QuemyRobert WrembelNatalia LopuszynskaGeorge PapadakisAgustín D. DelgadoPublished in: Inf. Syst. (2023)
Keyphrases
- text classification
- text mining
- feature selection
- naive bayes
- machine learning
- digital libraries
- legal knowledge
- semantic features
- n gram
- conceptual retrieval
- travel time
- domain experts
- text categorization
- domain independent
- cross domain
- neural network
- text classifiers
- data cleaning
- active learning
- artificial intelligence
- legal ontologies