CentralMatch: A Fast and Accurate Method to Identify Blog-Duplicates.
Heejin ParkSang-Chul LeeSoon-Haeng LeeSang-Wook KimPublished in: Web Intelligence (2010)
Keyphrases
- high accuracy
- computationally efficient
- fully automatic
- experimental evaluation
- classification method
- cost function
- dynamic programming
- completely automatic
- similarity measure
- main contribution
- detection method
- synthetic data
- matching algorithm
- high precision
- highly accurate
- support vector machine
- prior knowledge
- mutual information
- objective function
- high quality
- optimization method
- neural network