CLEU - A Cross-language english-urdu corpus and benchmark for text reuse experiments.
Iqra MuneerMuhammad SharjeelMuntaha IqbalRao Muhammad Adeel NawabPaul RaysonPublished in: J. Assoc. Inf. Sci. Technol. (2019)
Keyphrases
- cross language
- text retrieval
- mono lingual
- cross lingual
- explicit semantic analysis
- plagiarism detection
- question answering
- cross language information retrieval
- character n grams
- open domain
- document collections
- textual and visual information
- parallel corpora
- document retrieval
- query translation
- information access
- information retrieval
- spoken document retrieval
- comparable corpora
- language independent
- parallel corpus
- text categorization
- text corpora
- spontaneous speech
- sentence level
- noun phrases
- sentiment analysis
- machine translation system
- cross language retrieval
- language modeling
- word pairs
- image retrieval
- statistical machine translation
- language identification
- text mining
- chinese english
- qa clef
- machine translation
- transfer learning
- multiword
- natural language processing
- retrieval model
- cl sr
- digital libraries