Wikipedia-Based Document Categorization.
Krzysztof CiesielskiPiotr BorkowskiMieczyslaw A. KlopotekKrzysztof TrojanowskiKamil WysockiPublished in: SIIS (2011)
Keyphrases
- document categorization
- document representation
- text categorization
- text classification
- meta learning
- text documents
- wordnet
- document collections
- document clustering
- named entities
- vector space model
- entity identification
- document classification
- wikipedia articles
- latent semantic indexing
- semantic relations
- inductive learning
- semantic information
- knowledge base
- bag of words
- text mining
- knn
- text corpus
- decision trees