Web Document Classification by Keywords Using Random Forests.
Myungsook KlassenNikhila PaturiPublished in: NDT (2) (2010)
Keyphrases
- random forests
- web document classification
- keywords
- document classification
- probabilistic neural network
- random forest
- text documents
- knn
- probabilistic relational models
- decision trees
- machine learning algorithms
- ensemble methods
- logistic regression
- web documents
- search engine
- decision tree ensembles
- k nearest neighbor
- text categorization
- text mining
- web pages
- benchmark datasets
- text classification
- support vector
- data sets