Web document clustering based on a new niching Memetic Algorithm, Term-Document Matrix and Bayesian Information Criterion.
Carlos Alberto Cobos LozadaClaudia MontealegreMaria-Fernanda MejiaMartha MendozaElizabeth LeónPublished in: IEEE Congress on Evolutionary Computation (2010)
Keyphrases
- fitness function
- memetic algorithm
- web documents
- bayesian information criterion
- crossover operator
- evolutionary computation
- genetic algorithm
- vector space model
- model selection
- latent semantic indexing
- mixture model
- document representation
- gaussian mixture model
- information extraction
- tabu search
- web pages
- selection criterion
- keywords
- information retrieval
- maximum likelihood
- simulated annealing
- metaheuristic
- clustering algorithm