The method for detecting plagiarism in a collection of documents.
Natalya ShakhovskaIryna ShvorobPublished in: CSIT (2015)
Keyphrases
- computational cost
- cost function
- segmentation method
- pairwise
- document collections
- high accuracy
- similarity measure
- computational complexity
- clustering method
- significant improvement
- high precision
- database
- web documents
- co occurrence
- probabilistic model
- image retrieval
- preprocessing
- information retrieval
- neural network