A Similarity-Aware Multiversion Concurrency Control and Updating Algorithm for Up-to-Date Snapshots of Data.
Thomas GustafssonHugo HallqvistJörgen HanssonPublished in: ECRTS (2005)
Keyphrases
- input data
- data sets
- noisy data
- detection algorithm
- similarity measure
- data collection
- database
- similarity matrix
- data reduction
- computational cost
- data mining techniques
- incremental algorithms
- expectation maximization
- segmentation algorithm
- optimization algorithm
- synthetic data
- data objects
- information loss
- optimal solution
- knowledge discovery
- synthetic datasets
- objective function
- similarity function
- data structure
- spectral clustering
- data distribution
- missing data
- preprocessing
- k means
- probabilistic model
- np hard
- learning algorithm
- multidimensional scaling
- training data
- similarity metric
- hamming distance
- image data
- dynamic programming
- high dimensional data
- cost function
- data streams
- original data
- search space
- clustering method
- distance function