Arabic Plagiarism Detection Using Word Correlation in N-Grams with K-Overlapping Approach, Working Notes for PAN-AraPlagDet at FIRE 2015.
Salha AlzahraniPublished in: FIRE Workshops (2015)
Keyphrases
- n gram
- plagiarism detection
- character n grams
- cross language
- language model
- source code
- bag of words
- language independent
- duplicate detection
- word segmentation
- text classification
- language modeling
- variable length
- part of speech
- question answering
- word level
- language specific
- tree kernels
- open source
- cross lingual
- word recognition