Structure-based classification of web documents using Support Vector Machine.
Kejing HeChenyang LiPublished in: CCIS (2016)
Keyphrases
- web documents
- support vector machine
- document classification
- classification method
- support vector
- svm classifier
- semi structured
- classification algorithm
- web pages
- feature vectors
- feature selection
- web search engines
- html documents
- information extraction
- feature space
- keywords
- machine learning
- web content
- web data
- automatic classification
- decision boundary
- multi class
- training set
- training data
- decision trees
- text classification
- support vector machine svm
- image classification
- kernel methods
- vector space model
- supervised learning
- classify documents
- text mining
- data model
- unstructured documents