RealKIE: Five Novel Datasets for Enterprise Key Information Extraction.
Benjamin TownsendMadison MayChristopher WellsPublished in: CoRR (2024)
Keyphrases
- information extraction
- machine learning
- decision trees
- natural language processing
- information management
- precision and recall
- information systems
- case study
- synthetic and real datasets
- information technology
- textual data
- data sets
- free text
- benchmark datasets
- semi structured
- uci machine learning repository
- information retrieval
- text mining
- active learning
- training data
- artificial intelligence