Siamese-Based Architecture for Cross-Lingual Plagiarism Detection in English-Hindi Language Pairs.
Basant AgarwalMukesh Kumar GuptaHarish SharmaRamesh Chandra PooniaPublished in: Big Data (2023)
Keyphrases
- cross lingual
- cross language
- plagiarism detection
- indian languages
- machine translation
- parallel corpus
- source language
- language specific
- comparable corpora
- target language
- language independent
- cross language information retrieval
- language modeling
- machine translation system
- query translation
- statistical machine translation
- parallel corpora
- bilingual dictionaries
- text classification
- translation model
- natural language
- mono lingual
- source code
- document clustering
- sentiment classification
- pairwise
- word pairs
- word sense disambiguation
- text categorization
- natural language processing
- text retrieval
- question answering
- monolingual retrieval