Geographical classification of documents using evidence from Wikipedia.
Rafael Odon de AlencarClodoveu A. Davis Jr.Marcos André GonçalvesPublished in: GIR (2010)
Keyphrases
- document collections
- pattern recognition
- document classification
- classification accuracy
- information retrieval
- feature extraction
- feature vectors
- feature selection
- machine learning
- document representation
- automatic categorization
- support vector machine svm
- document clustering
- text documents
- class labels
- text classification
- supervised learning
- knowledge base
- xml documents
- pre classified
- metadata
- document categorization
- semantic information
- document structure
- wikipedia articles
- automatic classification
- hedge detection
- vector space
- classification method
- retrieval systems
- web documents
- wordnet
- image classification
- training set
- support vector