Efficient Data Redistribution to Speedup Big Data Analytics in Large Systems.
Long ChengTao LiPublished in: HiPC (2016)
Keyphrases
- data sets
- database
- data analysis
- data collection
- data processing
- knowledge discovery
- raw data
- computer systems
- storage systems
- big data
- synthetic data
- data points
- training data
- databases
- image data
- statistical analysis
- high dimensional data
- real world
- data mining techniques
- data sources
- query processing
- prior knowledge
- sensor data
- decision making
- original data
- data quality
- data retrieval