On data skewness, stragglers, and MapReduce progress indicators.
Emilio CoppaIrene FinocchiPublished in: CoRR (2015)
Keyphrases
- data sets
- data collection
- raw data
- data sources
- complex data
- data quality
- input data
- data points
- data analysis
- synthetic data
- original data
- statistical analysis
- computer systems
- data processing
- image data
- knowledge discovery
- high quality
- data distribution
- databases
- machine learning
- noisy data
- data objects
- data structure
- information systems
- temporal information
- test data
- sensor data
- missing data
- prior knowledge
- cloud computing
- decision trees