Keyphrases
- synthetic data
- input data
- missing data
- data sets
- test data
- cost function
- similarity measure
- training data
- training samples
- correlation analysis
- user input
- noisy data
- statistical methods
- missing values
- database
- significant improvement
- feature set
- data structure
- data analysis
- xml documents
- knowledge discovery
- data collection
- parameter estimation
- clustering method
- detection method
- decision trees
- probabilistic model
- segmentation method
- data sources
- spectral clustering
- pairwise
- data quality
- support vector machine
- high accuracy