XLCoST: A Benchmark Dataset for Cross-lingual Code Intelligence.
Ming ZhuAneesh JainKarthik SureshRoshan RavindranSindhu TipirneniChandan K. ReddyPublished in: CoRR (2022)
Keyphrases
- benchmark datasets
- cross lingual
- machine translation
- cross lingual information retrieval
- language independent
- language modeling
- text classification
- event extraction
- cross language
- artificial intelligence
- parallel corpora
- transfer learning
- translation model
- parallel corpus
- learning algorithm
- indian languages
- word sense
- document clustering
- bag of words
- word alignment
- natural language processing
- co occurrence