Modeling Website Topic Cohesion at Scale to Improve Webpage Classification.
Dhivya EswaranPaul N. BennettJoseph J. Pfeiffer IIIPublished in: SIGIR (2015)
Keyphrases
- website
- pattern recognition
- classification accuracy
- web pages
- feature extraction
- automatic classification
- neural network
- classification process
- classification scheme
- text classification
- classification algorithm
- feature vectors
- search engine
- machine learning
- classification systems
- benchmark datasets
- improve the classification accuracy
- support vector machine
- training set
- feature selection
- unsupervised learning
- image classification
- class labels
- svm classifier
- supervised learning
- classification method
- machine learning methods
- preprocessing
- learning algorithm