Approximate Partition Selection for Big-Data Workloads using Summary Statistics.
Kexin RongYao LuPeter BailisSrikanth KandulaPhilip Alexander LevisPublished in: CoRR (2020)
Keyphrases
- big data
- summary statistics
- data processing
- cloud computing
- high volume
- vast amounts of data
- data analysis
- big data analytics
- unstructured data
- data intensive
- data management
- social media
- business intelligence
- massive data
- data sets
- knowledge discovery
- social computing
- massive datasets
- data driven decision making
- database systems
- real world
- database management systems
- information extraction
- data science
- metadata
- feature selection
- data intensive computing