Column-Based Partitioning for Data in High Dimensional Space.
Ekasit KijsipongseSudsanguan NgamsuriyarojPublished in: ICPP (2007)
Keyphrases
- data sets
- data quality
- historical data
- database
- raw data
- statistical analysis
- data collection
- data analysis
- end users
- image data
- original data
- data processing
- high quality
- databases
- complex data
- data sources
- big data
- neural network
- noisy data
- data distribution
- training data
- social networks
- temporal information
- domain experts
- high dimensional data
- small number
- knowledge discovery
- data points
- probability distribution
- prior knowledge
- training set