Optimization and analysis of large scale data sorting algorithm based on Hadoop.
Zhuo WangLonglong TianDianjie GuoXiaoming JiangPublished in: CoRR (2015)
Keyphrases
- optimization algorithm
- noisy data
- input data
- data analysis
- np hard
- optimization process
- statistical analysis
- data sets
- objective function
- massive scale
- contingency tables
- data processing
- probabilistic model
- data sources
- learning algorithm
- big data
- detection algorithm
- data structure
- missing data
- optimization method
- computational complexity
- preprocessing
- simulated annealing
- data points
- stochastic gradient
- database
- dynamic programming
- data reduction
- training data
- combinatorial optimization
- optimization model
- data collection
- particle swarm optimization
- linear programming
- data mining techniques
- cost function
- optimal solution
- query processing
- data mining
- search space
- synthetic datasets
- optimization problems
- constrained optimization
- expectation maximization
- data distribution
- association rule mining
- cloud computing
- segmentation algorithm