Data Deduplication Using Dynamic Chunking Algorithm.
Young Chan MoonHo Min JungChuck YooYoung Woong KoPublished in: ICCCI (2) (2012)
Keyphrases
- data sets
- data processing
- learning algorithm
- input data
- computational complexity
- image data
- data collection
- search space
- preprocessing
- dynamic programming
- detection algorithm
- database
- segmentation algorithm
- data reduction
- cost function
- data sources
- synthetic data
- noisy data
- worst case
- original data
- data cleaning
- optimization algorithm
- expectation maximization
- search algorithm
- synthetic datasets
- data quality
- recognition algorithm
- incomplete data
- information loss
- training data
- dynamic environments
- data mining
- high quality
- data mining techniques
- knowledge discovery
- data points
- probability distribution
- probabilistic model
- np hard
- k means
- objective function
- data structure