Comparing Annotated Datasets for Named Entity Recognition in English Literature.
Rositsa IvanovaMarieke van ErpSabrina KirranePublished in: LREC (2022)
Keyphrases
- named entity recognition
- annotated corpus
- proper names
- natural language processing
- named entities
- information extraction
- text summarization
- named entity recognizer
- linguistic features
- semi supervised
- maximum entropy
- conditional random fields
- relation extraction
- machine translation
- natural language
- genia corpus
- text mining
- automatic annotation
- classifier ensemble
- wordnet
- co occurrence
- databases
- benchmark datasets
- query expansion
- unsupervised learning
- pos tagging
- active learning
- training data
- maximum entropy classifier
- data sets