Data Partitioning Strategies for Simulating non-IID Data Distributions in the DDM-PS-Eval Evaluation Platform.
Mikolaj MarkiewiczJakub KoperwasPublished in: ICSOFT (2022)
Keyphrases
- data partitioning
- data distribution
- skyline computation
- parallel query processing
- high dimensional data
- data streams
- distributed data
- query processing
- database systems
- similarity search
- hierarchical clustering
- query execution
- data mining
- highly scalable
- index structure
- skyline queries
- main memory
- training data
- labeled data
- data skew
- data management