A Framework for Measuring Differences in Data Characteristics.
Venkatesh GantiJohannes GehrkeRaghu RamakrishnanWei-Yin LohPublished in: J. Comput. Syst. Sci. (2002)
Keyphrases
- data sets
- database
- data processing
- data collection
- prior knowledge
- data points
- statistical analysis
- data analysis
- knowledge discovery
- training data
- neural network
- multimedia data
- sensor data
- application domains
- experimental data
- high dimensional data
- heterogeneous sources
- historical data
- noisy data
- data distribution
- synthetic data
- data mining techniques
- xml documents
- training set