Samayik: A Benchmark and Dataset for English-Sanskrit Translation.
Ayush MaheshwariAshim GuptaAmrith KrishnaAtul Kumar SinghGanesh RamakrishnanAnil Kumar GourishettyJitin SinglaPublished in: LREC/COLING (2024)
Keyphrases
- machine translation
- target language
- source language
- statistical machine translation
- cross language information retrieval
- cross lingual
- parallel corpus
- machine translation system
- query translation
- natural language
- language resources
- natural language processing
- information extraction
- finite state transducers
- parallel corpora
- chinese english
- language independent
- bilingual dictionaries
- word alignment
- machine readable dictionaries
- word sense disambiguation
- english chinese
- word level
- comparable corpora
- statistical translation models
- cross language
- real world
- english language
- cross lingual information retrieval
- feature set
- benchmark datasets
- training dataset
- synthetic datasets