Towards Document Plagiarism Detection Based on the Relevance and Fragmentation of the Reused Text.
Fernando Sánchez-VegaLuis Villaseñor PinedaManuel Montes-y-GómezPaolo RossoPublished in: MICAI (1) (2010)
Keyphrases
- plagiarism detection
- authorship attribution
- source code
- information retrieval
- duplicate detection
- text documents
- cross language
- keywords
- document retrieval
- retrieved documents
- document images
- relevance feedback
- information retrieval systems
- document classification
- retrieval systems
- relevance model
- semantic information
- document collections
- vector space model
- test collection
- document clustering
- web documents
- text classification
- open source
- text mining
- web pages
- search engine