Study on meaningful string extraction algorithm for improving webpage classification.
Jie ChenJian LiHao LiaoQingsheng YuanXiuguo BaoPublished in: NLPKE (2011)
Keyphrases
- preprocessing
- classification algorithm
- computational complexity
- dynamic programming
- detection algorithm
- np hard
- learning algorithm
- classification method
- experimental study
- pattern recognition
- convergence rate
- computational cost
- classification scheme
- expectation maximization
- recognition algorithm
- benchmark data sets
- decision trees
- similarity measure
- optimal solution
- optimization algorithm
- support vector machine svm
- probabilistic model
- machine learning
- accuracy rate
- cost function
- significant improvement
- hamming distance
- regular expressions
- multi class classification
- simulated annealing
- worst case
- association rules
- support vector
- data structure