Assessing Data Virtualization for Irregularly Replicated Large Datasets.
Bruno DinizDiego L. NogueiraAndré CardosoRenato FerreiraDorgival Olavo Guedes NetoWagner Meira Jr.Published in: CCGRID (2006)
Keyphrases
- data sets
- database
- training data
- data structure
- data processing
- synthetic data
- data points
- data quality
- data analysis
- raw data
- data sources
- image data
- data collection
- statistical analysis
- experimental conditions
- data mining algorithms
- original data
- learning algorithm
- fault tolerant
- missing values
- test data
- data distribution
- clustering algorithm
- probability distribution
- prior knowledge