Don't cry over spilled records: Memory elasticity of data-parallel applications and its application to cluster scheduling.
Calin IorgulescuFlorin DinuAunn RazaWajih Ul HassanWilly ZwaenepoelPublished in: CoRR (2017)
Keyphrases
- data sets
- data points
- image data
- database
- synthetic data
- original data
- data distribution
- attribute values
- data processing
- knowledge discovery
- high quality
- training data
- data analysis
- data objects
- data transfer
- data records
- data sources
- data collection
- resource allocation
- raw data
- data mining
- computational power
- memory space
- memory footprint