A Fast Document Classification Algorithm for Gene Symbol Disambiguation in the BITOLA Literature-Based Discovery Support System.
Andrej KastrinDimitar HristovskiPublished in: AMIA (2008)
Keyphrases
- classification algorithm
- document classification
- knn
- k nearest neighbor
- support vector machine
- information retrieval
- accurate classification
- training phase
- concept drift
- naive bayes
- biomedical literature
- learning algorithm
- training set
- co occurrence
- text classification
- class labels
- microarray
- database
- knowledge discovery
- natural language
- classification rules
- information retrieval systems
- natural language processing
- classification method
- data streams
- knowledge representation
- input features
- nearest neighbor