An extended version of sectional MinHash method for near-duplicate detection.
Mohammad Javad ShayeganMehdi Faizollahi-SamarinPublished in: J. Supercomput. (2022)
Keyphrases
- experimental evaluation
- fully automatic
- high accuracy
- high precision
- detection method
- main contribution
- data sets
- statistical model
- experimental study
- computationally efficient
- classification accuracy
- optimization method
- multiresolution
- preprocessing
- objective function
- multiscale
- evaluation method
- em algorithm
- synthetic data
- feature set
- significant improvement
- similarity measure