Document Copy Detection System Based on Plagiarism Patterns.
NamOh KangSang-Yong HanPublished in: CICLing (2006)
Keyphrases
- copy detection
- information retrieval
- pattern mining
- document classification
- sequential patterns
- interesting patterns
- pattern discovery
- data mining techniques
- web documents
- semantic information
- similar patterns
- database
- information retrieval systems
- feature selection
- data mining
- neural network
- retrieval systems
- document retrieval
- document collections
- design patterns
- probabilistic model
- keywords
- document representation