Information extraction across textual corpora: semi-automatic text-tagging workflow with Chinese local gazetteers.
Calvin YehSean WangShih-Pei ChenPublished in: DH (2020)
Keyphrases
- automatic text
- information extraction
- cross document
- natural language processing
- named entity recognition
- text summarization
- named entities
- textual data
- free text
- event extraction
- multi document summarization
- part of speech
- metadata
- natural language
- text mining
- relation extraction
- pos tagging
- information retrieval
- text documents
- open domain
- structured data
- coreference resolution
- question answering
- web documents
- machine learning
- semi structured
- conditional random fields
- text data
- machine translation
- word segmentation
- keywords
- controlled vocabulary
- unknown words
- spatial information
- text classification
- co occurrence
- databases