Data-Aware Partitioning Schema in MapReduce.
Junjie LiangQiongni LiuLi YinDunhui YuPublished in: ICYCSEE (2015)
Keyphrases
- data sets
- data collection
- synthetic data
- data analysis
- small number
- data sources
- high quality
- data structure
- image data
- statistical analysis
- knowledge discovery
- probability distribution
- missing data
- original data
- sensor data
- data objects
- data quality
- high dimensional data
- data processing
- prior knowledge
- training data
- decision trees
- neural network
- databases