Developing Monolingual English Corpus for Plagiarism Detection using Human Annotated Paraphrase Corpus.
Salar MohtajHabibollah AsghariVahid ZarrabiPublished in: CLEF (Working Notes) (2015)
Keyphrases
- cross language
- plagiarism detection
- parallel corpus
- statistical machine translation
- manually annotated
- question answering
- cross lingual
- recognizing textual entailment
- cross language information retrieval
- comparable corpora
- machine translation
- multiword
- machine translation system
- parallel corpora
- chinese english
- natural language
- relation extraction
- language independent
- query translation
- information access
- linguistic features
- text categorization
- source code
- domain specific
- information retrieval
- machine learning
- sentence level
- retrieval model
- semantic roles
- structured data
- source language
- query expansion
- probabilistic model