A Distributed Algorithm for Determining the Provenance of Data.
Paul T. GrothPublished in: eScience (2008)
Keyphrases
- data sets
- input data
- learning algorithm
- noisy data
- distributed data
- preprocessing
- synthetic data
- detection algorithm
- data collection
- dynamic programming
- database
- data quality
- data mining techniques
- original data
- worst case
- data sources
- np hard
- cost function
- search space
- training data
- data structure
- image data
- computational cost
- segmentation algorithm
- optimization algorithm
- data transfer
- data distribution
- objective function
- data analysis
- single scan
- heterogeneous data
- multimedia data
- lower bound
- high dimensional data
- clustering method
- simulated annealing