Genetic Algorithm Based Parallel K-Means Data Clustering Algorithm Using MapReduce Programming Paradigm on Hadoop Environment (GAPKCA).
Sayer AlshammariMaslina Binti ZolkepliRusli Bin AbdullahPublished in: SCDM (2020)
Keyphrases
- data sets
- clustering algorithm
- k means
- training data
- database
- clustering analysis
- data mining techniques
- data sources
- data analysis
- computer systems
- document clustering
- spectral clustering
- big data
- xml documents
- open source
- distributed systems
- input data
- cloud computing
- clustering method
- data clustering
- data reduction
- data structure