Intrinsic Plagiarism Detection Using Character Trigram Distance Scores - Notebook for PAN at CLEF 2011.
Mike KestemontKim LuyckxWalter DaelemansPublished in: CLEF (Notebook Papers/Labs/Workshop) (2011)
Keyphrases
- plagiarism detection
- cross language
- question answering
- document retrieval
- source code
- language model
- text retrieval
- cross lingual
- information access
- cross language information retrieval
- information retrieval
- document collections
- text categorization
- duplicate detection
- query translation
- probabilistic model
- machine learning
- n gram
- query expansion
- domain specific
- case study
- open source
- word segmentation