Sign in

A systematic study on parameter correlations in large-scale duplicate document detection.

Shaozhi YeJi-Rong WenWei-Ying Ma
Published in: Knowl. Inf. Syst. (2008)
Keyphrases
  • real world
  • keywords
  • information retrieval systems
  • statistical analysis
  • detection algorithm
  • document images
  • web documents
  • false positives
  • document clustering
  • automatic detection
  • qualitative and quantitative