The CALBC Silver Standard Corpus for Biomedical Named Entities - A Study in Harmonizing the Contributions from Four Independent Named Entity Taggers.
Dietrich Rebholz-SchuhmannAntonio José Jimeno-YepesErik M. van MulligenNing KangJan A. KorsDavid MilwardPeter T. CorbettEkaterina BuykoKatrin TomanekElena BeisswangerUdo HahnPublished in: LREC (2010)
Keyphrases
- named entities
- genia corpus
- information extraction
- text mining
- annotated corpus
- named entity recognition
- linguistic features
- named entity extraction
- co occurrence
- natural language processing
- noun phrases
- person names
- relation extraction
- question answering
- news corpus
- text documents
- text corpus
- chinese named entity recognition
- maximum entropy model
- named entity disambiguation
- unsupervised learning
- weakly supervised
- graphical models
- contextual features
- global context
- coreference resolution
- proper names
- information retrieval
- wordnet
- personal names
- domain specific
- natural language