Research on Optimization of Data Balancing Partition Algorithm Based on Spark Platform.
Suzhen WangZhiting JiaWenli WangPublished in: ICAIS (2) (2021)
Keyphrases
- optimization algorithm
- data sets
- data collection
- noisy data
- detection algorithm
- segmentation algorithm
- optimization process
- synthetic datasets
- optimization method
- synthetic data
- input data
- incomplete data
- information loss
- k means
- dynamic programming
- computational cost
- neural network
- knowledge discovery
- data analysis
- learning algorithm
- objective function
- expectation maximization
- search space
- metaheuristic
- prior information
- clustering algorithm
- similarity measure
- data structure
- computational complexity
- data points
- probability distribution
- database
- optimization model
- optimization criteria
- high dimensional data
- clustering method
- data mining techniques
- worst case
- data sources
- cost function
- preprocessing
- decision trees