CHisIEC: An Information Extraction Corpus for Ancient Chinese History.
Xuemei TangQi SuJun WangZekun DengPublished in: LREC/COLING (2024)
Keyphrases
- information extraction
- open domain
- event extraction
- text summarization
- information extraction systems
- mono lingual
- natural language text
- linguistic patterns
- natural language processing
- named entity recognition
- precision and recall
- free text
- question answering
- text mining
- writing style
- unknown words
- semi structured
- web corpora
- named entities
- conditional random fields
- web mining
- entity extraction
- keyword extraction
- structured data
- machine learning
- relation extraction
- information retrieval
- ontology based information extraction
- extracting meaningful
- text documents
- machine translation
- text processing
- relational learning
- manually annotated
- chinese english
- textual data
- cultural heritage
- chinese text
- tree bank
- data mining
- semantic roles
- test set
- data extraction
- coreference resolution
- traditional chinese medicine
- word segmentation