Co-Scheduling of Computation and Data on Computer Clusters.
Alexandru RomosanDoron RotemArie ShoshaniDerek WrightPublished in: SSDBM (2005)
Keyphrases
- data points
- data analysis
- data sets
- raw data
- data collection
- data structure
- input data
- data processing
- computer systems
- statistical analysis
- training data
- data distribution
- database
- synthetic data
- missing data
- data quality
- clustering analysis
- knowledge discovery
- probability distribution
- clustering algorithm
- information systems
- metadata
- spatial data
- data mining
- complex data
- input space
- learning algorithm
- original data
- statistical methods
- small number
- high quality
- data model
- data sources
- end users
- dimensionality reduction