CharSpan: Utilizing Lexical Similarity to Enable Zero-Shot Machine Translation for Extremely Low-resource Languages.
Kaushal MauryaRahul KejriwalMaunendra Sankar DesarkarAnoop KunchukuttanPublished in: EACL (2) (2024)
Keyphrases
- machine translation
- target language
- machine readable dictionaries
- language independent
- cross lingual
- bilingual dictionaries
- natural language processing
- word sense disambiguation
- statistical machine translation
- word pairs
- language specific
- multilingual documents
- language resources
- machine translation system
- query translation
- source language
- parallel corpora
- cross language information retrieval
- language processing
- information extraction
- comparable corpora
- natural language
- similarity measure
- cross lingual information retrieval
- word alignment
- lexical information
- lexical semantics
- multilingual information retrieval
- linguistic resources
- text summarization
- natural language generation
- pos tagging
- wordnet
- parallel corpus
- chinese english
- cross language
- word order
- natural language text
- semantic role labeling
- question answering
- data mining
- named entity recognition