Smart Crawler: Using Committee Machines for\\Web Pages Continuous Classification.
Luiz Henrique Zambom SantanaRonaldo dos Santos MelloMauro RoisenbergPublished in: WebMedia (2015)
Keyphrases
- web pages
- website
- web page classification
- search engine
- image classification
- text classification
- pattern recognition
- feature extraction
- support vector machine svm
- machine learning
- web search
- classification systems
- pattern classification
- class labels
- classification method
- classification scheme
- supervised learning
- support vector machine
- classification accuracy
- training set
- feature selection
- feature vectors
- training data
- decision trees
- web browser
- google search engine
- page segmentation
- web crawlers
- topic specific
- neural network
- web users
- web content
- classification algorithm
- web search engines
- web documents
- training samples
- model selection
- feature space