Using Zero-Shot Transfer to Initialize azWikiNER, a Gold Standard Named Entity Corpus for the Azerbaijani Language.
Kamran IbiyevAttila NovákPublished in: TDS (2021)
Keyphrases
- gold standard
- named entities
- annotated corpus
- person names
- linguistic features
- named entity recognition
- semi automatic
- ground truth
- noun phrases
- named entity disambiguation
- relation extraction
- named entity extraction
- information extraction
- genia corpus
- news corpus
- co occurrence
- natural language processing
- question answering
- text mining
- contextual features
- linguistic knowledge
- natural language
- automatic annotation
- unsupervised learning
- mechanical turk
- real world
- conditional random fields
- coreference resolution
- semantic role labeling
- proper names
- high quality
- feature selection
- domain specific
- machine learning
- text documents