Keyphrases
- synthetic data
- test data
- input data
- training data
- data distribution
- correlation analysis
- database
- data quality
- statistical methods
- dynamic programming
- information loss
- raw data
- high precision
- data sets
- detection method
- clustering method
- statistical analysis
- data processing
- data structure
- data sources
- high accuracy
- prior knowledge
- pairwise
- large scale data sets
- similarity measure
- noisy data
- data analysis
- cost function
- classification accuracy
- data points
- em algorithm
- mobile devices
- missing data
- prior information
- high quality
- statistical significance
- clustering algorithm
- semi supervised