k-Information Gain Scaled Nearest Neighbors: A Novel Approach to Classifying Protein-Protein Interaction-Related Documents.
Kyle H. AmbertAaron M. CohenPublished in: IEEE ACM Trans. Comput. Biol. Bioinform. (2012)
Keyphrases
- information gain
- protein protein interactions
- related documents
- nearest neighbor
- text categorization
- computational methods
- k nearest neighbor
- knn
- decision trees
- high throughput
- mutual information
- semantic similarity
- feature selection
- protein interaction
- document collections
- information retrieval systems
- information retrieval
- high dimensional data
- data points
- document clustering
- training set
- keywords
- relevant documents
- correlation coefficient
- similarity measure
- user queries
- web documents
- document retrieval
- microarray
- data sets
- active learning
- document representation
- high dimensional
- database systems
- data mining