Unknown malcode detection via text categorization and the imbalance problem.
Robert MoskovitchDima StopelClint FeherNir NissimYuval EloviciPublished in: ISI (2008)
Keyphrases
- text categorization
- text classification
- text documents
- feature selection
- reuters corpus
- information gain
- knn
- automated text categorization
- text classifiers
- semi supervised learning
- multi label
- term frequency
- document categorization
- tf idf
- k nearest neighbor
- automatic text categorization
- text collections
- feature weighting
- document classification
- naive bayes
- term weighting
- document frequency
- semantic browsing
- feature selection for text categorization