Dslash: Managing Data in Overloaded Batch Streaming Systems.
Robert BirkeMathias BjörkqvistEvangelia KalyvianakiLydia Y. ChenPublished in: ICDCS (2016)
Keyphrases
- data sets
- small number
- raw data
- data analysis
- prior knowledge
- statistical analysis
- database
- original data
- data collection
- image data
- data quality
- synthetic data
- computer systems
- distributed systems
- management system
- data sources
- machine learning
- high quality
- training data
- website
- storage systems
- complex data
- learning algorithm
- metadata
- network structure
- data acquisition
- data distribution
- knowledge discovery
- high dimensional data
- decision trees
- data structure
- expert systems
- probability distribution