Constructing Multilingual Code Search Dataset Using Neural Machine Translation.
Ryo SekizawaNan DuanShuai LuHitomi YanakaPublished in: CoRR (2023)
Keyphrases
- machine translation
- cross lingual
- language independent
- language resources
- cross language information retrieval
- chinese english
- natural language processing
- multilingual documents
- language specific
- machine translation system
- parallel corpus
- language processing
- statistical machine translation
- natural language generation
- information extraction
- word sense disambiguation
- query translation
- word alignment
- multilingual information retrieval
- cross lingual information retrieval
- comparable corpora
- target language
- bilingual dictionaries
- cross language
- bilingual lexicon
- natural language
- text retrieval
- n gram
- text classification
- digital libraries