Using a Dictionary and n-gram Alignment to Improve Fine-grained Cross-Language Plagiarism Detection.
Nava EhsanFrank Wm. TompaAzadeh ShakeryPublished in: DocEng (2016)
Keyphrases
- fine grained
- plagiarism detection
- cross language
- n gram
- language independent
- character n grams
- document retrieval
- language model
- text retrieval
- question answering
- cross lingual
- text classification
- source code
- cross language information retrieval
- information access
- text categorization
- document collections
- duplicate detection
- bag of words
- language modeling
- natural language processing
- parallel corpora
- named entities
- part of speech
- databases
- query expansion
- information retrieval
- data mining