On data skewness, stragglers, and MapReduce progress indicators.
Emilio CoppaIrene FinocchiPublished in: SoCC (2015)
Keyphrases
- database
- high quality
- training data
- data sources
- application domains
- statistical analysis
- data collection
- data sets
- data quality
- experimental data
- data points
- sensor data
- small number
- original data
- raw data
- image data
- end users
- data streams
- input data
- missing data
- knowledge discovery
- probability distribution
- data structure
- data objects
- metadata
- standard deviation
- complex data