Cross-Lingual Text Classification of Transliterated Hindi and Malayalam.
Jitin KrishnanAntonios AnastasopoulosHemant PurohitHuzefa RangwalaPublished in: CoRR (2021)
Keyphrases
- cross lingual
- text classification
- indian languages
- bilingual dictionaries
- cross lingual information retrieval
- language independent
- text categorization
- language modeling
- feature selection
- labeled data
- machine learning
- cross language
- bag of words
- text mining
- n gram
- knn
- parallel corpus
- multi lingual
- unlabeled data
- text documents
- translation model
- machine translation
- text classifiers
- artificial intelligence
- statistical machine translation
- information retrieval
- k nearest neighbor
- parallel corpora
- word segmentation
- target language
- semantic features
- query translation
- transfer learning
- image classification