MuRIL: Multilingual Representations for Indian Languages.
Simran KhanujaDiksha BansalSarvesh MehtaniSavya KhoslaAtreyee DeyBalaji GopalanDilip Kumar MargamPooja AggarwalRajiv Teja NagipoguShachi DaveShruti GuptaSubhash Chandra Bose GaliVish SubramanianPartha P. TalukdarPublished in: CoRR (2021)
Keyphrases
- indian languages
- cross lingual
- cross lingual information retrieval
- language identification
- document images
- language independent
- text classification
- word segmentation
- language modeling
- news articles
- machine learning
- spoken language
- speaker identification
- multi lingual
- chinese english
- search engine
- information retrieval