BIOfid Dataset: Publishing a German Gold Standard for Named Entity Recognition in Historical Biodiversity Literature.
Sajawel AhmedManuel StoeckelChristine DrillerAdrian PachzeltAlexander MehlerPublished in: CoNLL (2019)
Keyphrases
- gold standard
- named entity recognition
- information extraction
- named entities
- natural language processing
- ground truth
- semi automatic
- text summarization
- maximum entropy
- conditional random fields
- mechanical turk
- sequence labeling
- semi supervised
- annotated corpus
- relation extraction
- information retrieval
- benchmark datasets
- proper names
- chinese named entity recognition
- feature set
- semantic relations
- unsupervised learning
- graphical models
- co occurrence
- text mining
- natural language
- maximum entropy classifier
- high quality
- named entity disambiguation
- learning algorithm