SimProf: A Sampling Framework for Data Analytic Workloads.
Jen-Cheng HuangLifeng NaiPranith KumarHyojong KimHyesoon KimPublished in: IPDPS (2017)
Keyphrases
- data analysis
- data collection
- data processing
- complex data
- main contribution
- data points
- data sets
- raw data
- sensor data
- synthetic data
- input data
- small number
- knowledge discovery
- database
- high quality
- original data
- computer systems
- statistical analysis
- information systems
- spatial data
- sampled data
- data quality
- data mining
- data distribution
- experimental data
- data mining algorithms
- bayesian networks
- data structure
- probabilistic model
- probability distribution
- sensor networks