Hyperpolyglot LLMs: Cross-Lingual Interpretability in Token Embeddings.
Andrea W. Wen-YiDavid MimnoPublished in: CoRR (2023)
Keyphrases
- cross lingual
- machine translation
- language independent
- cross lingual information retrieval
- language modeling
- vector space
- cross language
- text classification
- low dimensional
- event extraction
- dimensionality reduction
- parallel corpus
- news articles
- query translation
- translation model
- statistical machine translation
- mono lingual
- transfer learning
- distance measure
- parallel corpora
- language model
- indian languages
- knowledge discovery
- machine learning