IEPile: Unearthing Large-Scale Schema-Based Information Extraction Corpus.
Honghao GuiLin YuanHongbin YeNingyu ZhangMengshu SunLei LiangHuajun ChenPublished in: CoRR (2024)
Keyphrases
- information extraction
- open domain
- information extraction systems
- manually annotated
- relation extraction
- text mining
- precision and recall
- natural language processing
- databases
- natural language text
- linguistic patterns
- data model
- information retrieval
- free text
- small scale
- named entity recognition
- entity extraction
- question answering
- machine learning
- real world
- relational learning
- ontology based information extraction
- textual data
- web mining
- machine translation
- database schema
- xml schema
- text processing
- semistructured data
- named entities
- conditional random fields
- extraction patterns
- semi structured
- web scale
- probabilistic model
- real life
- search engine
- database