Recognition of protein/gene names from text using an ensemble of classifiers.
Guodong ZhouDan ShenJie ZhangJian SuSoon-Heng TanPublished in: BMC Bioinform. (2005)
Keyphrases
- ensemble learning
- training data
- document analysis
- multiple classifiers
- training set
- ensemble classifier
- ensemble pruning
- classifier ensemble
- lexical features
- multiple classifier systems
- protein protein interaction networks
- feature selection
- sequence alignment
- recognition rate
- individual classifiers
- keywords
- biomedical literature
- protein interaction
- protein sequences
- regulatory networks
- text mining
- feature extraction
- cellular processes
- gene prediction
- medline abstracts
- biological entities
- homo sapiens
- decision trees
- majority voting
- ensemble methods
- naive bayes
- microarray
- gene expression
- feature ranking
- support vector
- protein structure
- combining classifiers
- machine learning
- weak classifiers
- accurate classifiers
- gene expression data
- dna sequences
- weak learners
- protein structure prediction
- handwritten text
- multi class
- amino acids
- learning algorithm
- text lines