MapReduce: Simplified Data Processing on Large Clusters.
Jeffrey DeanSanjay GhemawatPublished in: OSDI (2004)
Keyphrases
- data processing
- data analysis
- computer systems
- clustering algorithm
- cloud computing
- cluster analysis
- hierarchical clustering
- data management
- data acquisition
- stream processing
- document clustering
- parallel computing
- unsupervised clustering
- data points
- decision trees
- parallel processing
- database
- high performance data mining
- fuzzy c means
- fuzzy clustering
- data clustering
- hierarchical structure
- data distribution