The method for detecting plagiarism in a collection of documents.

Natalya Shakhovska Iryna Shvorob

Published in: CSIT (2015)

Keyphrases

computational cost
cost function
segmentation method
pairwise
document collections
high accuracy
similarity measure
computational complexity
clustering method
significant improvement
high precision
database
web documents
co occurrence
probabilistic model
image retrieval
preprocessing
information retrieval
neural network