GENETAG: a tagged corpus for gene/protein named entity recognition.
Lorraine K. TanabeNatalie XieLynne H. ThomWayne MattenW. John WilburPublished in: BMC Bioinform. (2005)
Keyphrases
- named entity recognition
- annotated corpus
- named entities
- sequence alignment
- information extraction
- natural language processing
- linguistic features
- genia corpus
- named entity disambiguation
- maximum entropy
- reference resolution
- protein interaction
- text summarization
- genomic sequences
- conditional random fields
- semi supervised
- protein structure
- pos tagging
- relation extraction
- protein sequences
- gene expression
- microarray
- amino acids
- classifier ensemble
- pairwise
- data mining
- noun phrases
- data sets
- text mining
- biomedical literature
- automatic annotation
- binding sites
- protein protein interactions
- question answering
- model selection