A web page distillation strategy for efficient focused crawling based on optimized Naïve bayes (ONB) classifier.
Ahmed I. SalehArwa E. AbulwafaMohammed F. Al RahmawyPublished in: Appl. Soft Comput. (2017)
Keyphrases
- tree augmented
- bayes classifiers
- bayes classifier
- bayesian classifiers
- focused crawling
- naive bayes classifier
- web pages
- bayesian network classifiers
- bayesian networks
- decision trees
- web documents
- bayesian classifier
- feature selection
- topic specific
- support vector
- website
- rule learner
- web users
- web search engines
- text classification
- search engine
- text classifiers
- naive bayes
- generative model
- domain knowledge
- bayes net