Keyphrases
- data sets
- high dimensional data
- missing values
- data sources
- statistical methods
- high quality
- small number
- data processing
- synthetic data
- historical data
- probability distribution
- data mining techniques
- data points
- data collection
- data structure
- data quality
- spectral clustering
- noisy data
- data distribution
- data mining methods
- prior knowledge
- statistical significance
- predictive model
- data mining applications
- statistical tests
- data representations
- original data
- human experts
- machine learning methods
- experimental data
- statistical analysis
- computer systems
- image data
- computational cost
- xml documents
- decision trees