Sāmayik: A Benchmark and Dataset for English-Sanskrit Translation.
Ayush MaheshwariAshim GuptaAmrith KrishnaGanesh RamakrishnanG. Anil KumarJitin SinglaPublished in: CoRR (2023)
Keyphrases
- machine translation
- target language
- statistical machine translation
- cross language information retrieval
- source language
- query translation
- cross lingual
- machine translation system
- parallel corpus
- natural language processing
- language resources
- information extraction
- chinese english
- language independent
- benchmark datasets
- word alignment
- natural language
- word level
- parallel corpora
- comparable corpora
- bilingual dictionaries
- cross language retrieval
- database
- mono lingual
- cross lingual information retrieval
- english chinese
- cross language
- real world
- document images
- feature set
- co occurrence