Keyphrases
- data sets
- raw data
- data analysis
- data quality
- database
- application domains
- synthetic data
- data points
- data structure
- original data
- prior knowledge
- data sources
- end users
- probability distribution
- xml documents
- knowledge discovery
- data collection
- computer systems
- statistical analysis
- missing data
- high quality
- test data
- training data
- learning algorithm