Web document clustering based on Global-Best Harmony Search, K-means, Frequent Term Sets and Bayesian Information Criterion.
Carlos Alberto Cobos LozadaJennifer AndradeWilliam ConstainMartha MendozaElizabeth LeónPublished in: IEEE Congress on Evolutionary Computation (2010)
Keyphrases
- web documents
- bayesian information criterion
- harmony search
- k means
- document representation
- harmony search algorithm
- web pages
- metaheuristic
- information criterion
- genetic algorithm
- simulated annealing algorithm
- model selection
- information extraction
- web logs
- keywords
- gaussian mixture model
- differential evolution
- hill climbing
- clustering algorithm
- mixture model
- prefetching
- cross validation
- bp neural network
- mutation operator
- response time
- dynamically generated
- data sets