Paragraph Clustering for Intrinsic Plagiarism Detection using a Stylistic Vector Space Model with Extrinsic Features.
Julian BrookeGraeme HirstPublished in: CLEF (Online Working Notes/Labs/Workshop) (2012)
Keyphrases
- vector space model
- plagiarism detection
- document clustering
- clustering algorithm
- information retrieval
- vector space
- clustering method
- semantic information
- k means
- semantic similarity
- feature vectors
- low level
- knowledge representation
- support vector machine
- text mining
- contextual information
- structural features
- duplicate detection