CSF: An Efficient Parallel Deduplication Algorithm by Clustering Scattered Fingerprints.
Hao FanGuangping XuYi ZhangLiming YuanYanbing XuePublished in: ISPA/BDCloud/SocialCom/SustainCom (2019)
Keyphrases
- k means
- clustering method
- detection algorithm
- optimal solution
- computational complexity
- hierarchical clustering
- cost function
- experimental evaluation
- synthetic datasets
- similarity measure
- preprocessing
- significant improvement
- segmentation algorithm
- data clustering
- clustering analysis
- expectation maximization
- objective function
- learning algorithm
- cluster analysis
- depth first search
- distance metric
- hardware implementation
- parallel implementation
- outlier detection
- matching algorithm
- computationally efficient
- particle swarm optimization
- simulated annealing
- probabilistic model
- dynamic programming
- np hard
- clustering algorithm