INMT-Lite: Accelerating Low-Resource Language Data Collection via Offline Interactive Neural Machine Translation.
Harshita DiddeeAnurag ShuklaTanuja GanuVivek SeshadriSandipan DandapatMonojit ChoudhuryKalika BaliPublished in: LREC/COLING (2024)
Keyphrases
- machine translation
- data collection
- target language
- language processing
- language specific
- source language
- natural language
- language resources
- machine translation system
- multilingual documents
- parallel corpus
- language independent
- bilingual dictionaries
- cross language information retrieval
- natural language processing
- natural language generation
- parallel corpora
- information extraction
- cross lingual
- comparable corpora
- word sense disambiguation
- chinese english
- phrase based smt
- data analysis
- brazilian portuguese
- word alignment
- word order
- pos tagging
- data mining
- statistical machine translation
- linguistic resources
- machine transliteration
- precision and recall
- query translation