An improved two-step algorithm for task and data parallel scheduling in distributed memory machines.
Savina BansalPadam KumarKuldip SinghPublished in: Parallel Comput. (2006)
Keyphrases
- input data
- parallel implementation
- data sets
- noisy data
- objective function
- optimization algorithm
- data collection
- segmentation algorithm
- knowledge discovery
- detection algorithm
- matching algorithm
- learning algorithm
- data quality
- preprocessing
- database
- multiprocessor systems
- training data
- original data
- data distribution
- data reduction
- high dimensional data
- distributed memory machines
- dynamic programming
- data structure
- computational complexity
- data analysis
- computational cost
- data sources
- worst case
- simulated annealing
- clustering method
- synthetic data
- missing data
- probabilistic model
- similarity measure
- cost function
- spectral clustering
- k means
- search space
- feature space
- synthetic datasets
- parallel processors
- expectation maximization