An Efficient Webpage Classification Algorithm Based on LSH.
Junjun LiuHaichun SunZhijun DingPublished in: ICYCSEE (2015)
Keyphrases
- information retrieval
- classification algorithm
- search engine
- knn
- k nearest neighbor
- document classification
- hierarchical classification
- training phase
- training set
- naive bayes
- classification method
- class labels
- support vector machine
- classification rules
- web pages
- locality sensitive hashing
- accurate classification
- nearest neighbor
- learning algorithm
- input features
- databases
- similarity search
- concept drift
- decision trees
- classifier ensemble
- text categorization
- data analysis
- object recognition