Fast data-oriented microaggregation algorithm for large numerical datasets.
Reza MortazaviSaeed JaliliPublished in: Knowl. Based Syst. (2014)
Keyphrases
- data sets
- input data
- synthetic datasets
- dimensional data
- information loss
- database
- raw data
- data collection
- data points
- learning algorithm
- data quality
- original data
- preprocessing
- detection algorithm
- data reduction
- objective function
- noisy data
- data structure
- k means
- search space
- computational complexity
- clustering method
- numerical data
- image data
- probabilistic model
- optimal solution
- synthetic and real datasets
- sampling methods
- expectation maximization
- training data
- data distribution
- cost function
- genetic algorithm
- classification trees
- high dimensional datasets
- feature space
- decision trees
- feature subset
- matching algorithm
- high dimensional data
- data analysis
- data mining techniques
- simulated annealing