UPPC - Urdu Paraphrase Plagiarism Corpus.
Muhammad SharjeelPaul RaysonRao Muhammad Adeel NawabPublished in: LREC (2016)
Keyphrases
- recognizing textual entailment
- sentiment analysis
- plagiarism detection
- sentence level
- manually annotated
- supervised machine learning
- multi lingual
- word level
- test set
- spoken dialog
- n gram
- question answering
- coreference resolution
- natural language text
- text processing
- multiword
- relation extraction
- probabilistic model
- textual entailment
- comparable corpora
- knowledge base
- data sets