Modular Data Clustering - Algorithm Design beyond MapReduce.
Martin HahmannDirk HabichWolfgang LehnerPublished in: EDBT/ICDT Workshops (2014)
Keyphrases
- data sets
- clustering algorithm
- data processing
- data quality
- data points
- data collection
- data analysis
- raw data
- training data
- xml documents
- data model
- image data
- synthetic data
- high dimensional data
- input data
- end users
- spectral clustering
- database
- neural network
- data distribution
- data integrity
- hierarchical clustering algorithm
- experimental data
- data mining techniques
- knowledge discovery
- data sources
- prior knowledge
- user interface
- high quality