Prospective Validation of Text Categorization Filters for Identifying High-Quality, Content-Specific Articles in MEDLINE.
Yindalon AphinyanaphongsConstantin F. AliferisPublished in: AMIA (2006)
Keyphrases
- text categorization
- text classification
- feature selection
- k nearest neighbor
- information gain
- knn
- multi label
- naive bayes
- reuters corpus
- automated text categorization
- text mining
- text documents
- feature weighting
- term weighting
- multi instance multi label learning
- feature selection for text categorization
- machine learning
- unlabeled data
- term frequency
- text classifiers
- semi supervised learning
- document frequency
- training documents
- pairwise
- semantic browsing