A Scalable Parallel Deduplication Algorithm.
Walter SantosThiago TeixeiraCarla MachadoWagner Meira Jr.Renato FerreiraDorgival Olavo Guedes NetoAltigran Soares da SilvaPublished in: SBAC-PAD (2007)
Keyphrases
- learning algorithm
- parallel implementation
- computationally efficient
- single pass
- computational complexity
- improved algorithm
- k means
- worst case
- np hard
- detection algorithm
- times faster
- experimental evaluation
- dynamic programming
- computational cost
- theoretical analysis
- high accuracy
- recognition algorithm
- depth first search
- selection algorithm
- optimization algorithm
- highly efficient
- expectation maximization
- input data
- simulated annealing
- cost function
- significant improvement
- preprocessing
- optimal solution
- similarity measure
- sorting algorithms
- data sets
- tree structure
- neural network