Uncertainty-Based Noise Reduction and Term Selection in Text Categorization.
Charles M. E. E. PetersCornelis H. A. KosterPublished in: ECIR (2002)
Keyphrases
- noise reduction
- term selection
- text categorization
- signal to noise ratio
- edge detection
- knn
- document frequency
- feature selection
- naive bayes
- information gain
- text classification
- multi label
- k nearest neighbor
- term frequency
- query expansion
- text documents
- tf idf
- multiscale
- semi supervised learning
- information retrieval