PPChecker: Plagiarism Pattern Checker in Document Copy Detection.
NamOh KangAlexander F. GelbukhSang-Yong HanPublished in: TSD (2006)
Keyphrases
- copy detection
- information retrieval systems
- document images
- document classification
- source code
- multimedia documents
- document collections
- pattern matching
- information retrieval
- tf idf
- document clustering
- web documents
- retrieval systems
- document retrieval
- open source
- document processing
- data sets
- keywords
- digital documents
- semantic information
- text documents
- probabilistic model
- data structure
- inverted index
- document structure
- plagiarism detection
- pattern detection