Keyphrases
- input data
- data sets
- high quality
- training data
- noisy data
- database
- incomplete data
- data analysis
- preprocessing
- optimization algorithm
- single scan
- synthetic data
- detection algorithm
- data collection
- data mining techniques
- image data
- dynamic programming
- computational complexity
- learning algorithm
- similarity measure
- optimal solution
- knowledge discovery
- k means
- cost function
- probabilistic model
- xml documents
- high dimensional data
- matching algorithm
- prior information
- spectral clustering
- data sources
- high accuracy
- information loss
- data reduction