Develop corpora and methods for cross-lingual text reuse detection for English Urdu language pair at lexical, syntactical, and phrasal levels.
Iqra MuneerRao Muhammad Adeel NawabPublished in: Lang. Resour. Evaluation (2022)
Keyphrases
- cross lingual
- parallel corpus
- machine translation
- language specific
- natural language processing
- comparable corpora
- natural language
- language independent
- indian languages
- text corpora
- keywords
- multi lingual
- mono lingual
- linguistic resources
- computational linguistics
- language identification
- machine translation system
- latent semantic analysis
- character n grams
- cross language
- bilingual dictionaries
- word sense
- chinese english
- text collections
- cross lingual information retrieval
- european languages