Document controversy classification based on the Wikipedia category structure.
Michal Jankowski-LorekKazimierz ZielinskiPublished in: Comput. Sci. (2015)
Keyphrases
- document classification
- pattern recognition
- classify documents
- classification accuracy
- classification algorithm
- image classification
- hierarchical structure
- logical structure
- document collections
- text classification
- category labels
- automatic classification
- support vector machine
- feature vectors
- feature space
- document structure
- document clustering
- information retrieval
- classification method
- feature extraction
- machine learning
- text documents
- class labels
- wikipedia articles
- supervised learning
- training set
- training documents
- wikipedia pages
- wikipedia categories