A negative category based approach for Wikipedia document classification.
Meenakshi Sundaram MurugeshanK. LakshmiSaswati MukherjeePublished in: Int. J. Knowl. Eng. Data Min. (2010)
Keyphrases
- document classification
- natural language text
- text categorization
- text mining
- text classification
- text documents
- web documents
- classification algorithm
- topic extraction
- wikipedia categories
- web document classification
- document collections
- automatic document classification
- wordnet
- linear classification
- named entities
- support vector machine
- knowledge representation
- feature selection
- data sets