Data distribution tailoring revisited: cost-efficient integration of representative data.
Jiwon ChangBohan CuiFatemeh NargesianAbolfazl AsudehH. V. JagadishPublished in: VLDB J. (2024)
Keyphrases
- data distribution
- cost efficient
- data points
- index structure
- data sets
- distributed data
- data streams
- high dimensional data
- streaming data
- data analysis
- image data
- multi dimensional data
- training instances
- database
- input data
- pattern recognition
- communication cost
- data skew
- management system
- high dimensional
- data structure
- training data
- machine learning
- real world
- neural network