"A Passage to India": Pre-trained Word Embeddings for Indian Languages.
Saurav KumarSaunack KumarDiptesh KanojiaPushpak BhattacharyyaPublished in: CoRR (2021)
Keyphrases
- indian languages
- pre trained
- document images
- language identification
- training data
- cross lingual
- word segmentation
- training examples
- vector space
- question answering
- control signals
- dimensionality reduction
- document image analysis
- decision trees
- english text
- document retrieval
- low dimensional
- prior knowledge
- machine learning
- n gram
- feature space